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phase  transformations 
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Abstract.  We  report  results  from  a  systematic  program  of  changing  composition  of  alloys  in  the 
system  TiNiX,  X=  Cu,  Pt,  Pd,  Au,  to  pursue  certain  special  lattice  parameters  that  have  been 
identified  previously  with  low  hysteresis.  We  achieve  A2  =  1,  where  A2  is  the  middle  eigenvalue 
of  the  transformation  strain  matrix,  for  alloys  with  X  =  Pt,  Pd,  Au.  In  all  cases  there  is  a  sharp 
drop  of  the  graph  of  hysteresis  vs.  composition  at  the  composition  where  A2  =  1.  When  the  size 
of  the  hysteresis  is  replotted  vs.  A2  we  obtain  an  universal  graph  for  these  alloys,  which  also  agrees 
with  trends  in  an  earlier  combinatorial  study  of  alloys  in  the  system  TiNiCu.  Motivated  by  these 
experimental  results,  we  present  a  new  theory  for  the  size  of  the  hysteresis  based  on  the  growth 
from  a  small  scale  of  fully  developed  austenite  martensite  needles.  In  this  theory  the  energy  of  the 
transition  layer  plays  a  critical  role.  New  methods  for  calculation  the  optimal  layer  are  developed 
that  rely  on  r~ convergence  arguments,  the  small  parameter  being  |A2  —  1|.  The  limiting  energy  of 
the  transition  layer  is  found  to  be  governed  by  a  nonstandard  linear  elasticity  problem.  Overall, 
the  results  point  to  a  simple  systematic  method  of  achieving  low  hysteresis  and  a  high  degree  of 
reversibility  in  transforming  materials. 
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1  Introduction 

This  paper  concerns  the  hysteresis  that  accompanies  martensitic  phase  transformations.  We  focus 
mainly  on  thermal  hysteresis. 

The  work  reported  here  follows  up  a  conjecture  of  [17]  that  asserts  a  relation  between  the  condi¬ 
tions  of  compatibility  between  two  phases  and  the  hysteresis  seen  during  cyclic  transformation.  The 
conjecture  was  formulated  by  looking  at  the  literature  (references  in  [17])  and  trying  to  understand 
what  is  common  among  alloys  with  particularly  low  hysteresis.  A  special  focus  of  this  literature 
search  was  on  data  from  repeated  experiments  on  the  same  specimen,  restored  to  its  original  shape 
after  each  test  by  heating,  in  a  material  with  minor  training  effects,  so  as  to  factor  out  as  much  as 
possible  processing  conditions.  Three  conditions  of  compatibility  were  conjectured  to  be  relevant  to 
the  minimization  of  hysteresis  and  the  reversibility  of  transformation:  1)  detU  =  1,  where  the  sym¬ 
metric  positive-definite  matrix  U  is  the  transformation  strain  matrix,  2)  A2  =  1,  where  Ai  <  A2  <  A3 
are  the  ordered  eigenvalues  of  U,  and  3)  the  conditions  A2  =  1  and  a-UiCof(Uf  —  I)n)  =  0,  together 
called  the  cofactor  conditions.  In  the  latter  the  vectors  a  and  n  are  certain  vectors  that  describe  the 
twin  system,  as  explained  below  (see  (3)).  The  first  of  these  conditions  summarizes  a  well-accepted 
idea  in  the  community. 

Jun  Cui  et  al.  [8]  noticed  that  the  system  Ti5oNi5o-xCux  for  small  values  of  x  nearly  satisfies  the 
first  two  of  these  conditions,  and  they  measured  over  a  wide  composition  range  both  the  eigenvalues 
of  U  and  the  hysteresis,  using  combinatorial  synthesis  methods.  Their  results  showed  a  strong  cor¬ 
relation  between  the  second  of  these  conditions  (A2  =  1)  and  the  size  of  the  hysteresis.  Surprisingly, 
the  measurements  showed  only  a  weak  correlation  between  the  size  of  the  hysteresis  and  the  volume 
change,  even  though  detU  varied  widely  from  about  0.96  to  1.07,  with  quite  a  few  alloys  near  the 
extremes  of  this  range.  More  surprisingly,  the  alloys  with  det  U  very  near  1  included  some  alloys 
with  the  largest,  and  others  with  the  smallest  hysteresis  among  all  the  alloys  measured.  All  this  is 
surprising  because,  since  the  earliest  days  of  the  study  of  martensite,  the  condition  det  U  ~  1  has 
been  widely  considered  important  for  reversibility  of  the  transformation,  and  elementary  elasticity 
calculations  of  the  stress  field  and  energy  surrounding  an  island  of  martensite  growing  in  a  hole  of 
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austenite  of  a  different  volume  suggests  the  presence  of  a  rather  large  energy  barrier  associated  with 
detU^l. 

The  main  purpose  of  this  paper  is  two- fold:  to  describe  the  results  of  a  program  of  alloy  devel¬ 
opment  in  which  the  composition  of  alloys  was  systematically  tuned  to  pursue  the  relation  A2  =  1, 
and  to  give  a  new  theory  for  the  prediction  of  hysteresis. 

We  are  able  to  achieve  A2  =  1  in  the  systems  TiNiPd,  TiNiPt,  and  TiNiAu,  and  we  exhibit 
alloys  in  each  of  these  systems  with  A2  <  1  (like  TiNi  itself)  and  with  A2  >  1.  There  is  a  sharp  drop 
of  the  hysteresis  for  alloys  with  A2  =  1  with  the  most  dramatic  example  being  Ti5oNi37Aui3,  which 
shows  a  decrease  in  the  size  of  the  hysteresis  of  about  a  factor  of  10.  When  the  size  of  the  hysteresis 
is  plotted  as  a  function  of  composition,  it  has  no  particularly  distinguishing  features,  but,  when 
it  is  replotted  as  a  function  of  A2,  there  is  a  collapse  of  the  data  onto  an  approximately  universal 
curve.  The  combinatorial  data  of  Jun  Cui  et  al.  [8]  also  fall  near  this  curve.  This  graph  has  some 
interesting  features,  including  an  apparent  singularity  at  A2  =  1.  This  also  is  quite  surprising,  as 
hysteresis  could  well  be  expected  to  be  sensitive  to  many  other  physical  parameters  besides  A2,  as 
well  as  processing  conditions. 

To  understand  this  dominance  of  A2,  we  are  led  to  propose  a  new  theory  of  hysteresis.  The 
condition  A2  =  1  is  the  condition  that  there  is  an  exact  interface  between  austenite  and  martensite, 
and  such  an  interface  does  not  contribute  the  usual  bulk  energy  of  its  transition  layer,  nor  its 
interfacial  energy  on  twin  bands.  The  theory  of  hysteresis  we  propose  is  based  on  the  hypothesis  that 
the  main  energy  barrier  leading  to  hysteresis  arises  from  these  contributions.  Thus,  we  propose  that 
the  main  energy  barrier  leading  to  hysteresis  is  the  growth  of  fully  developed  austenite/martensite 
interfaces.  We  implement  this  idea  via  a  nucleation  calculation.  Ours  is  not  the  first  nucleation 
calculation  that  has  been  done  for  martensite,  but,  as  far  as  we  can  determine,  previous  calculations 
have  not  allowed  such  fully  developed  interfaces. 

Recently,  the  interfaces  between  austenite  and  martensite  in  one  of  the  alloys  with  A2  =  1 
in  the  system  TiNiPd  has  been  observed  using  high  resolution  electron  microscopy  by  Delville  et 
al.  [10].  These  observations  confirm  the  presence  of  untwinned,  atomically  sharp  interfaces.  The 
microstructures  observed  are  unusual  as  compared  with  normal  (A2  7^  1)  martensites. 

This  theory  predicts  a  dramatic  sensitivity  of  the  size  of  the  hysteresis  to  A2.  In  particular,  the 
predicted  graph  of  hysteresis  vs.  A2  has  a  singularity  at  A2  =  1,  like  the  measured  data.  The  theory 
predicts  that  this  graph  depends  on  other  physical  parameters  including  the  critical  nucleus  size, 
the  other  eigenvalues  Ai,  A3,  the  interfacial  energy  constant,  an  elastic  modulus,  the  transformation 
temperature  and  the  latent  heat.  However,  for  the  alloys  studied,  the  dependence  on  these  other 
parameters,  to  the  extent  we  can  estimate  them,  is  weak,  consistent  with  the  universality  of  the 
measured  graphs.  The  graph  of  hysteresis  vs.  A2  is  not  predicted  to  be  universal  among  all  other 
alloys  of  other  symmetries  and  properties. 

Our  prediction  is  not  completely  quantitative  because  of  lack  of  knowledge  of  the  interfacial 
energy  constant,  the  critical  nucleus  size,  and  the  fact  that  our  model  of  the  transition  layer  is  not 
optimal.  Because  of  recent  and  ongoing  work  [16,  32,  37,  38,  42,  45]  on  various  new  methods  of 
measuring  interfacial  energy,  progress  is  expected  on  the  former. 

Regarding  the  optimality  of  the  transition  layer,  we  present  a  new  approach  to  the  calculation  of 
the  optimal  layer.  We  want  to  do  this  in  order  to  lay  the  groundwork  for  a  quantitative  calculation 
of  the  hysteresis,  but  also  for  another  reason  suggested  by  this  paper.  That  is,  when  we  use  a 
naive  but  reasonable  calculation  of  the  energy  of  the  layer,  we  find  a  dramatic  dependence  of  the 
energy  of  layer  on  the  twin  system  that  participates  in  the  austenite/martensite  interface,  Table 
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4.  For  the  lattice  parameters  of  Tir)0Ni5o_xPdx,  x  ~  11,  this  energy  varies  almost  two  orders  of 
magnitude,  depending  on  the  twin  system,  even  within  a  given  twin  type.  This  is  quantified  by  a 
certain  geometric  factor.  If  this  property  extends  to  the  optimal  layer,  then  it  could  explain  why 
certain  twin  systems  are  preferred  in  martensitic  phase  transformations. 

Normally,  the  energy  of  the  transition  layer  is  thought  to  depend  on  the  nonlinear  elastic  proper¬ 
ties  of  the  material.  However,  the  elastic  energy  minimization  problem  for  determining  the  structure 
of  the  layer  contains  a  small  parameter  for  these  alloys,  namely  |A2  —  1|.  This  opens  the  way  for 
the  use  of  T-convergence  arguments  to  derive  a  limiting  variational  principle  for  determination  of 
the  elastic  energy  of  the  layer  for  alloys  with  A2  near  1.  We  do  this  in  Section  8.  As  is  typical  in 
such  arguments  the  limiting  variational  principle  has  a  universal  status.  That  is,  the  input  for  the 
calculation  of  the  limiting  energy  involves  only  the  lattice  parameters  of  austenite  and  martensite 
and  the  linear  elastic  moduli.  In  our  derivation  we  consider  only  minimization  of  the  elastic  energy 
of  the  martensite.  Future  work  should  consider  also  relaxation  of  the  austenite  as  well  as  the  pos¬ 
sibility  of  branching  of  the  twins  in  the  martensite  [20,  21,  19].  We  believe  that  our  derivation  lays 
the  groundwork  for  these  further  calculations,  which  can  lead  to  accurate  evaluation  of  the  energy 
in  the  transition  layer.  A  side  benefit  of  such  a  derivation  is  it  will  give  a  way  to  measure  interfacial 
energy  constant,  by  balancing  bulk  and  interfacial  energy  in  the  usual  way  by  studying  alloys  with 
A2  near  1  and  using  the  measured  fineness  of  the  twins  these  alloys. 

The  full  statement  of  the  T-limiting  problem  is  the  problem  of  minimizing  the  following  elastic 
energy, 


hif  2 


(v,i  ®A  Tm  +  (a  +  ^v,2  )  ®  A  Tn)  •  C[v,i  ®A  Tm  +  (a  +  ^v,2  )  ®  A  Tn] 


(dtidt2,  (1) 


over  functions  v  and  parameters  fj  subject  to  the  boundary  conditions, 


v(ti,  0)  ||  a,  v(fi,  1)  ||  a,  (v(H,0)  -  v(H,  1))  •  a  >  0,  -£  <  tx  <  0, 

v(0,  t2)  =  0,  v(— £,  t2)  =  C(1  —t2  +  fj)a,  0  <  t2  <  1.  (2) 


The  notation  is  the  following,  is  a  rectangular  domain  of  length  (',  m  is  the  normal  to  the  habit 
plane  when  the  condition  for  exact  compatibility  (A2  =  1)  is  satisfied,  a  is  the  amplitude  of  the  twin 
and  n  is  its  normal  at  A2  =  1,  A  =  I  +  b®mis  the  deformation  gradient  of  the  martensite  variant 
that  is  compatible  with  austenite  at  A2  =  1,  C  is  the  elasticity  tensor  at  A,  i.e.,  of  the  martensite  at 
the  compatible  variant,  £  is  the  periodicity  (length  scale)  of  the  twin  bands,  and  fj  is  a  length  that 
describes  the  offset  of  the  vanishingly  small  twin  band;  essentially,  it  describes  asymptotic  bending 
of  this  band.  It  is  believed  that,  without  the  rigorous  T-convergence  argument  presented  here,  it 
would  not  be  possible  to  guess  this  detailed  form  of  this  linear  elasticity  problem  for  the  transition 
layer. 

Reversibility,  measured  for  example  by  fatigue  life  under  cyclic  transformation,  is  as  important 
as  low  hysteresis  for  actuator  applications.  The  connection  between  size  of  the  initial  or  stabilized 
hysteresis  loop  and  the  fatigue  life  has  been  investigated  by  Gall  and  Maier  [13],  and  Kato  et  al.  [18], 
and  recently  by  Moumni  et  al.  [31].  All  of  these  authors  show  a  correlation  between  the  hysteresis 
and  fatigue  life.  The  latter  authors  conclude  that  “it  is  shown  that  the  dissipated  energy  of  the 
stabilized  cycle  is  a  relevant  parameter  for  the  estimation  of  lifetime” . 

There  are  many  papers  that  discuss  hysteresis  in  martensites.  These  include  papers  that  propose 
constitutive  equations  for  martensitic  materials  from  which  hysteresis  loops  are  computed  ([43], 
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[24],  [6],  [44],  [26],  [48]),  those  that  discuss  hysteresis  in  martensites  in  the  presence  of  disorder 
([39],  [46]);  papers  that  relate  hysteresis  to  the  attainment  of  a  self-organized  critical  state  [34], 
to  metastability  induced  by  incompatibility  [2],  and  to  pinning  of  interfaces  by  defects  ([22],  [49]). 
Another  line  of  work  models  hysteresis  using  the  Preisach  model  and  its  generalizations  [4],  Not  all 
of  these  theories  are  in  direct  contradiction  with  the  one  proposed  here.  Some  of  these  theories  are 
“micromechanical”  and  contain  the  parameter  A2  that  plays  such  an  important  role  in  this  paper, 
and  thus  these  theories  can  be  implemented  with  A2  near  1  so  as  to  examine  their  consistency  with 
the  measured  hysteresis  given  here. 

The  notation  is  similar  to  that  in  the  book  of  Bhattacharya  [5] .  In  particular  a  ®  n  is  the 
3x3  matrix  with  rectangular  Cartesian  components  ayrq ,  defined  from  two  vectors  a,  n  E  M3. 
SO(3)=  {R  E  R3x3  :  RTR  =  I  ,  detR  =1}  denotes  the  set  of  3  x  3  rotation  matrices.  Additional 
background  for  the  T-convergence  argument  is  given  in  Section  8.4.1. 

2  Experimental  methods 

The  main  experimental  results  reported  in  this  paper  are  measurements  of  hysteresis  vs.  lattice 
parameters  in  a  family  of  TiNiX  alloys,  X  =  Cu,  Pd,  Pt,  Au,  in  which  the  composition  was  system¬ 
atically  tuned  to  make  the  middle  eigenvalue  of  the  transformation  strain  matrix  equal  to  1.  For 
the  interpretation  of  these  results  in  terms  of  energy  barriers,  measurements  of  transformation  tem¬ 
perature,  latent  heat  and  density  are  needed,  and  these  are  also  reported.  The  experimental  results 
are  presented  in  Section  4.  In  this  section  we  briefly  summarize  the  experimental  methods  that 
were  used.  More  detailed  information,  particularly  concerning  uncertainty  and  the  repeatability  of 
the  measurements,  can  be  found  in  the  thesis  of  Zhang  [50]. 

Alloys  of  TiNiX,  X  =  Cu,  Pd,  Pt,  Au,  were  arc-melted  on  a  water  cooled  copper  hearth  from  high 
purity  elemental  materials  under  argon  protection,  after  purging  the  chamber  several  times  under 
vacuum.  Ti  was  used  as  an  hydrogen- getter.  The  resulting  buttons  were  sliced  using  an  electrical 
discharge  machine  (EDM)  and  then  heat  treated  under  vacuum  in  quartz  ampoules,  followed  by 
a  water  quench.  For  the  various  compositions  the  annealing  temperature  was  700-850°C,  and 
quenching  water  temperature  was  0-40°  C.  A  careful  electrolytic  polishing  gave  samples  for  both  the 
x-ray  measurements  and  differential  scanning  calorimetric  (DSC)  measurements.  The  electrolyte 
was  85-90%  glacial  acetic  acid  (CH3COOH)  and  10-15%  perchloric  acid  (HCLO4)  by  volume,  the 
cathode  was  stainless  steel,  the  anode  was  stainless  steel  or  Ti,  the  voltage  was  35-40V,  and  the 
temperature  of  the  bath  was  0°C. 

Lattice  parameters  of  the  alloys  were  measured  on  a  Scintag  x-ray  diffractometer  outfitted  with  a 
temperature  controlled  stage.  Special  attention  was  paid  to  alignment  by  using  an  internal  standard 
(NIST  standard  reference  640c)  and  periodically  doing  in-situ  alignment  at  different  temperatures. 
The  eigenvalues  Ai,  A2,  A3  were  calculated  directly  from  the  lattice  parameters  and  formulas  given 
below.  The  density  of  the  alloys  was  calculated  from  atomic  composition  of  the  starting  materials 
and  measured  unit  cell  volume. 

The  DSC  measurements  were  conducted  on  the  TA  Instruments  Q1000  according  to  the  ASTM 
standard  F2004-03.  The  DSC  samples  were  thinned  to  100  pm,  and  electrolytically  polished  by  the 
method  described  for  the  x-ray  specimens. 

Both  the  x-ray  diffraction  and  calorimetry  methods  were  used  to  determine  the  transformation 
temperatures.  Austenite  is  stable  at  high  temperature  and  martensite  is  stable  at  low  temperature. 
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At  some  temperature  9C  in  between,  the  two  phases  have  the  same  bulk  free  energy.  Because  of 
the  presence  of  hysteresis,  6C  is  difficult  to  measure  directly.  When  austenite  is  cooled  down  to  a 
certain  temperature,  it  begins  to  transform  to  martensite.  This  temperature  is  the  martensite  start 
temperature  Ms.  With  further  cooling,  the  transformation  is  completed  at  the  martensite  finish 
temperature  M/.  When  martensite  is  heated,  it  transforms  back  to  austenite.  The  austenite  start 
temperature  and  austenite  finish  temperature  are  denoted  by  As  and  Af .  respectively.  To  get  these 
four  temperatures  from  the  DSC  measurements  we  used  the  standard  procedure  of  constructing 
intersections  of  approximate  asymptotic  fines.  Operationally,  we  defined  0C  as  the  average  of  the 
four  characteristic  temperatures,  i.e.,  (As+A^+Ms+M^)/4.  In  the  x-ray  measurements,  the  trans¬ 
formation  temperature  9C  was  defined  as  the  intensity  of  austenite  peak  at  half  of  its  maximum 
value.  The  hysteresis  was  defined  as  (As+A/)-(Ms+M/). 

In  the  x-ray  method  we  determined  the  temperatures  M.s,  M/,  As,  Af  using  the  following 
procedure.  The  29  angle  was  first  confined  to  the  interval  39°  to  46°  because  the  (110)  peak  of  B2 
phase,  the  (111)  peak  of  B19  phase,  and  the  (Ill),  (111)  peaks  of  the  B19'  phase  of  the  TiNiX 
alloys,  which  are  the  strongest  among  all  peaks  of  the  corresponding  phases,  are  within  this  range. 
We  then  scanned  the  sample  at  high  temperature  to  obtain  a  pure  austenite  pattern.  Using  the 
same  method,  we  scanned  the  sample  at  low  temperature  to  get  a  pure  martensite  pattern.  Then 
we  gradually  decreased  the  scan  interval  of  temperature.  Because  x-ray  patterns  are  sensitive  to 
a  change  of  lattice  structures,  they  can  detect  a  tiny  phase  change  which  would  be  difficult  to 
quantify  by  other  methods.  To  have  a  definite  criterion  for  “start”  and  “finish”  we  established  a 
convention.  We  let  the  pure  austenite  pattern  at  high  temperature  and  the  pure  martensite  pattern 
at  low  temperature  be  standards.  Then,  the  temperatures  at  which  the  corresponding  major  peaks 
decreased  2%  from  those  of  the  standard  patterns  were  taken  to  be  As  or  M,s,  respectively.  Af  and 
Mf  were  defined  by  a  similar  criterion  using  98%  of  the  standard  peaks. 

The  latent  heat  L  was  defined  as  the  area  within  the  triangle  with  base  (A/  -  As)/2  and 
height  equal  to  the  maximum  height  of  the  DSC  peak,  divided  by  heating/cooling  speed,  which  was 
typically  10°C/s.  These  were  measured  on  the  heating  part  of  the  cycle  because  the  cooling  part 
exhibited  a  longer  tail  that  made  the  definition  of  M/  somewhat  difficult. 


3  Theoretical  methods 

The  variable  A2  will  denote  the  middle  eigenvalue  of  the  transformation  strain  matrix.  For  example, 
in  the  case  of  a  cubic  to  orthorhombic  phase  transformation,  the  six  linear  transformations  that 
map  the  cubic  structure  to  the  six  orthorhombic  variants  of  martensite  are  given  by  (15)  in  the 
cubic  basis.  Their  eigenvalues  are  assumed  to  be  ordered,  Ai  <  A2  <  A3.  The  alloys  of  interest  in 
this  paper  have  A2  near  1. 

The  theoretical  part  of  this  paper  relies  on  a  technique  called  T-convergence,  due  in  its  abstract 
form  to  De  Giorgi  [9].  This  method  allows  one  to  pass  from  a  variational  principle  depending 
on  a  small  parameter  to  a  limiting  variational  principle.  In  the  present  case  the  small  parameter 
is1  |  A2  —  1 1 ,  and  the  variational  principle  is  the  minimization  of  the  free  energy  stored  near  the 
austenite/martensite  interface.  As  is  typical,  the  limiting  variational  principle  is  not  obtained  by 
putting  A2  =  1  (or  expanding  the  energy  density  in  a  Taylor  series  about  A2  =  1).  While  one  could 

1Or,  equivalently,  the  small  parameter  can  be  chosen  as  A,  the  volume  fraction  appearing  in  the  crystallographic 
theory  of  martensite,  this  being  related  to  A2  —  1  by  (34). 
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imagine  finding  the  minimizers  of  the  original  variational  principle,  calculating  their  asymptotic 
form  as  A2  approaches  1,  and  then  seeking  a  variational  principle  for  these  asymptotic  limits,  the 
method  of  T- convergence  gives  the  limit  directly,  without  the  intermediate  step  of  calculating  the 
minimizers  of  the  original  variational  principle.  The  method  involves  the  careful  calculation  of 
upper  and  lower  bounds  on  low  energy  deformations  as  A2  tends  to  1.  It  follows  from  the  method 
that  the  limit,  in  a  well-defined  sense,  of  a  sequence  of  minimizers  (parameterized  by  A2)  of  the 
original  variational  principle  is  a  minimizer  of  the  limit  energy,  and  their  energies  also  converge  to 
the  minimum  of  the  limiting  energy. 

As  is  typical  with  T-convergence,  the  limiting  variational  principle  is  simpler  than  the  original 
one  and  has  a  universal  status.  In  the  present  case  this  is  realized  by  the  fact  that  the  limiting  energy 
only  depends  on  knowledge  of  the  linearized  elastic  moduli  of  austenite  and  martensite,  whereas  the 
original  principle  depends  on  their  nonlinear  elastic  properties.  Once  these  moduli  are  measured 
for  A2  ~  1  alloys,  and  the  interfacial  energy  on  the  twins  are  measured,  the  kinds  of  scaling  laws 
discussed  by  Schryvers  [38]  can  be  investigated  on  a  quantitative  basis.  Recent  advances  on  the 
measurement  of  interfacial  energies  (see  [37],  [42],  [45])  make  this  a  realistic  possibility. 


4  Measurements  of  hysteresis,  transformation  temperature 
and  latent  heat  in  alloys  whose  lattice  parameters  have 
been  tuned  to  make  A2  =  1 

For  this  study  we  began  with  the  Ti5oNi,vj_xCu;),  system,  relying  on  a  paper  of  Moberly  and  Melton 
[30]  that  showed  the  presence  of  twin-free  habit  planes  in  this  alloy  at  x  —  10.  A  closer  examination 
of  the  original  thesis  of  Moberly  [29]  revealed  that  these  twin-free  interfaces  were  rather  rare:  the 
photographs  in  the  original  thesis  mainly  showed  twinned  interfaces  in  the  same  alloy.  In  hindsight, 
we  believe  that  the  pictures  chosen  for  publication  were  found  are  places  in  the  specimen  where,  due 
to  the  presence  of  stress,  the  lattice  parameters  were  perturbed  from  the  stress-free,  equilibrium 
values.  (In  our  subsequent  x-ray  measurements  this  could  be  understood  as  arising  from  lattice 
parameters  corresponding  to  points  near  the  tails  of  the  x-ray  peaks  in  this  system.)  As  is  known, 
at  about  x  =  10  at.  %  Cu  this  alloy  loses  the  monoclinic  martensite  in  favor  of  an  orthorhombic 
martensite.  We  prepared  a  matrix  of  alloys  of  Ti  =  48,  49,  50,  51,  52  at.%,  Cu  =  0,  5,  10,  15, 
20,  25,  35  at.%  with  Ni  =  100  -  Ti  -  Cu  and  measured  lattice  parameters.  The  variation  of  A2  vs. 
composition  for  these  alloys  is  shown  in  Figure  1. 

For  the  case  Ti  =  50  at.%  we  also  prepared  alloys  with  Cu  =  25,  30,  35  but  these  contained 
excessive  TiCu  precipitates  and  are  not  discussed  here. 

This  data  motivated  the  wider  combinatorial  study  [8]  of  this  system.  Taken  together,  and 
accounting  for  the  slightly  decreased  accuracy  of  the  combinatorial  measurements,  we  did  not  find 
convincing  evidence  that  A2  =  1  was  achieved  at  any  composition  the  TiNiCu  system,  although,  as 
seen  in  Figure  1,  Ti49Ni3iCu2o  comes  quite  close.  Thus,  we  shifted  further  study  to  other  alloys. 

The  Hume-Rothery  rules  for  atomic  size  and  valence  indicated  that  Rh,  Ir,  Pd,  Pt,  Ag,  and 
Au,  all  substituting  for  Ni,  would  be  good  candidates  for  stabilization  of  the  orthorhombic  phase. 
Extensive  work  on  the  alloys  Ti5oNi5o-xAgx  revealed  a  maximum  soluability  of  x  =  2  at.  %  Ag 
which  we  were  not  able  to  overcome  by  heat  treatment,  and  therefore  did  not  yield  any  useful 
alloys.  We  note  that  A2  <  1  is  satisfied  by  all  the  TiNiCu  alloys  mentioned  above,  and  also  by 
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Figure  1:  Measurements  of  A2  vs.  composition  in  the  TiNiCu  system,  Ti  =  48,  49,  50,  51,  52  at.%, 
Cu  =  0,  5,  10,  15,  20,  25,  35  at.%  with  Ni  =  100  -  Ti  -  Cu. 


TiNi  itself.  Thus,  the  identification  of  starting  alloys  with  A2  >  1  was  critical  for  an  interpolation 
scheme.  We  noticed  from  work  of  Donkersloot  and  van  Vucht  [11]  that  measured  lattice  parameters 
at  the  martensitic  transformation  in  TiPd  have  A2  >  1,  and  our  subsequent  measurement  of  lattice 
parameters  in  Ti5oNi34Pdi6  showed  A2  >  1.  With  regard  to  the  basic  hypothesis  of  a  connection 
between  hysteresis  and  A2  =  1,  we  were  also  motivated  by  the  data  collected  in  US  Patent  5,951,793 
which  did  not  report  lattice  parameters  but  indicated  a  sharp  drop  in  hysteresis  for  Ti49  5Ni40.5Pdio. 
The  review  article  of  Miyazaki  and  Ishida  [28]  also  indicated  a  sharp  drop  of  the  hysteresis  near  10 
at.  %  Pd  in  the  data  they  report  on  films,  although  the  interpretation  suggested  by  the  authors  was 
of  a  continuous  decrease  of  hysteresis  with  increasing  Pd.  Referring  to  their  data,  Miyazaki  and 
Ishida  state  that  this  is  a  surprising  result. 

Having  found  that  Ti50Ni34Pdi6  has  A2  =  1.005  >  1  we  did  a  straightforward  interpolation.  A 
similar  procedure  was  followed  with  TisoNiso-^Xa,,  X  =  Pt,  Au:  first  locate  one  alloy  with  A2  >  1, 
then  interpolate  backward.  The  data  for  hysteresis  vs.  composition  for  X  =  Pd,  Pt,  Au  are  shown 
in  Figure  2.  Note  the  drop  of  hysteresis  of  a  factor  between  5  and  10.  Although  the  minimum 
hysteresis  in  all  cases  occurs  at  different  values  of  x,  in  each  case  the  minimum  occurs  very  near  the 
composition  where  A2  =  1. 
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Figure  2:  Width  of  the  hysteresis  vs.  x  for  alloys  Ti5oNi5o-a;Xx,  X  =  Pd,  Pt,  Au. 


At  each  data  point  of  Figure  2  we  also  measured  lattice  parameters  of  the  two  phases  at  the 
same  temperature.  From  this  we  calculated  A2  corresponding  to  each  point  in  Figure  2.  It  becomes 
more  interesting  to  eliminate  x  and  plot  hysteresis  directly  vs.  A2.  This  is  done  in  Figure  3.  The 
combinatorial  data  from  [8]  on  TiNiCu  is  also  included  in  this  plot.  One  can  see  a  remarkable  collapse 
of  the  data  onto  two  lines  shaped  like  a  V.  The  suggestion  of  this  plot  is  of  a  universal  behavior. 
However,  one  should  note  that,  while  latent  heats,  elastic  moduli  and  the  other  lattice  parameters 
vary  somewhat  among  these  alloys,  the  crystallography  is  in  all  cases  (except  the  TiNiHf  data,  see 
below)  cubic  to  orthorhombic.  Thus,  one  can  conjecture  that,  within  a  certain  crystallographic 
change,  the  main  parameter  that  controls  hysteresis  is  A2  and  the  behavior  is  universal.  The  theory 
presented  later  in  this  paper  supports  this  viewpoint.  We  have  also  included  on  Figure  3  data  on 
the  system  TiNiHf,  which  undergoes  a  cubic  to  monoclinic  (B2  to  B19’)  as  in  TiNi.  The  line  is 
drawn  using  data  of  Matveeva  et  al.  [27]  and  Potapov  et  al.  [35]  and  was  confirmed  by  our  own 
measurements  on  this  system.  Alloys  in  the  family  TiNiHf,  once  considered  promising  candidates 
as  high  temperature  shape  memory  alloys,  show  excessive  hysteresis  at  concentrations  of  Hf  that 
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Figure  3:  Width  of  the  hysteresis  vs.  A2 


raise  the  transformation  temperature  significantly.  This  can  be  understood  from  the  following 
simple  observation.  Binary  TiNi  has  A2  <  1.  Increase  of  the  percentage  of  Hf  (substituted  for  Ti) 
decreases  the  value  of  A2  making  it  further  from  1,  significantly  raising  the  hysteresis  as  indicated 
in  Figure  3. 

We  note  that  we  have  previously  reported  [17]  an  extension  of  alloys  satisfying  A2  =  1  to  the 
quaternary  system  TiNiCuPd. 

For  our  later  purpose  of  evaluating  certain  quantities  in  the  theory,  we  will  need  the  detailed 
data  on  lattice  parameters  (i.e.,  also  Ai  and  A3)  and  latent  heat.  This  data  is  reported  in  Table  1. 
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Table  1:  Eigenvalues  of  the  transformation  strain  matrix,  latent  heat  and  density  of  TiNiPd  alloys 


alloy 

0c(°C) 

Ai 

^2 

^3 

latent  heat  (J/g) 

density  (g/cm3) 

TiNi 

80 

0.36 

6.44 

TiNi43P  d7 

-20 

0.9398 

0.9970 

1.0606 

0.30 

6.75 

TiNi4iP  dg 

40 

0.9341 

0.9988 

1.0635 

0.14 

6.76 

TiNi3gPdn 

25 

0.9280 

1.0001 

1.0674 

0.23 

6.85 

TiN^Pdie 

20 

0.9280 

1.0050 

1.0633 

0.38 

7.11 

TiNi32Pdi8 

113 

0.9227 

1.0050 

1.0710 

0.58 

7.20 

TiNi28Pd2o 

103 

0.9244 

1.0060 

1.0691 

0.56 

7.35 

TiNi2sP  d22 

155 

0.9186 

1.0060 

1.0753 

0.70 

7.40 

TiNi25Pd25 

189 

0.9167 

1.0070 

1.0775 

0.60 

7.54 

5  Background  on  the  crystallographic  theory  and  special 
relations  among  lattice  parameters 

We  consider  a  material  transforming  from  austenite  to  martensite.  Figure  4  sets  the  notation.  We 
assume  |m|  =  1,  |n|  =  1  and  of  course  m  not  parallel  to  n.  The  vectors  n-1  and  m3-  he  in  the 
(m,  n)  plane  and  are  uniquely  determined  by  ir1  •  n  =  0,  m3-  •  m  =  0,  |n-L|  =  |m-L|  =  1,  m3-  •  n  >  0, 
n3-  •  m  =  m3-  •  n.  (This  normalization  is  not  the  standard  one,  but  it  is  more  convenient  here.) 


Figure  4:  Austenite/martensite  interface  with  a  simple  transition  layer  (C). 

We  assume  a  basic  framework  for  martensitic  phase  transformations.  That  is,  we  consider  a  free 
energy  density  <p(F.  6)  as  a  function  of  deformation  gradient  and  temperature  defined  on  M3x3  x  R-. 
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We  hold  6  fixed,  below  transformation  temperature,  and  drop  it  from  the  notation  until  later  in 
this  paper.  At  this  temperature  <p  is  assumed  to  have  a  local  minimum  at  I  and  a  lower  global 
minimum  on  the  martensite  energy  wells,  SO(3)Ui, . . SO(3)U/v.  We  assume  the  distinct  matrices 
Ui, . . . ,  U\y  are  positive-definite,  symmetric,  and  symmetry  related:  there  exists  R2, . . . ,  Rat  G 
SO(3)  such  that  U,  =  R;UiRf,i  =  2, . . .  ,  N.  We  also  assume  that  that  the  wells  SO(3)Ui  and 
SO(3)U2  are  rank-1  connected,  i.e.,  there  are  a,  n  G  R3,  |n|  =  1,  and  R  G  SO(3)  such  that 

RU2  —  Ui  =  a®n.  (3) 

To  do  the  asymptotic  analysis  we  stay  away  from  the  singular  situation  A2  =  1  by  assuming  that 
det(Uf  —  I)  ^  0.  The  matrices  A  =  RUi  and  B  =  RRU2,  R  G  SO(3),  are  assumed  to  satisfy  the 
equations  of  the  crystallographic  theory  of  martensite, 

(AB  +  (1  —  A)  A)  —  I  =  b  ®  m,  (4) 

for  suitable  0  <  A  <  1,  b,  m  G  R3,  |m|  =  1  and  R  G  SO(3).  Given  the  assumptions  above,  necessary 
and  sufficient  conditions  that  there  is  a  solution  R  G  SO (3),  b,m  G  R3  and  0  <  A  <  1  of  (4)  are 
that  (see  [3]) 

I  ^  1 2  I  |2 

8  <  —2,  and  trU?  —  det  U?  +  — — >  2, 

2o 

where  8  =  a  •  Ui(U^  —  I)_1n.  (5) 

When  these  conditions  are  satisfied  there  are  two  solutions  for  A  given  by  A  =  A*  and  A  =  1  —  A* 
where: 

y  =  l  +  \Jf^-  (6) 

Remark.  (Brief  summary  of  the  crystallographic  theory;  see  [47],  [25],  [7],  and  [3]  for  the  version 
given  here.)  After  substituting  for  B  using  (3),  the  equation  (AB  +  (1  —  A) A)  —  I  =  b  ®  m 
becomes  R(Ui  +  Aa  0  n)  =  I  +  b  ®  m,  which  has  a  solution  if  and  only  if  the  middle  eigenvalue  of 
Ga  =  (Ui  +  An  0  a)(Ui  +  Aa  ®  n)  is  1.  This  matrix  has  an  eigenvalue  equal  to  1  if  and  only  if 

g( A)  =  det[(Ui  +  An  ®  a)(Ui  +  Aa  ®  n)  —  I]  =0.  (7) 

Although  it  appears  to  be  a  sixth  order  polynomial,  g( A)  is  actually  quadratic  in  A  and  symmetric 
about  1/2,  so  it  can  be  written 

9(A)  =  CO  -  ;[)2  +  »■  (8) 

The  simplest  evaluation  of  (  and  r/  follows  from  the  equations 

\(  +  V  =  det(GA-I)U=0  =  det(U?-I) 

-C  =  ^det(GA-I)U=o  =  2(det(U2-I))(a-U1(U?-I)-1n)  (9) 

The  condition  that  g( A)  has  a  root  on  (0,1)  is  that  g(0)g(l/2)  <  0  which  gives  the  first  of  (5). 
Suppose  this  holds  and  call  the  roots  A*,l  —  A*.  A  necessary  and  sufficient  condition  that  these 
roots  are  associated  to  the  middle  eigenvalue  is  that 

0  A  (Ai  —  1)(1  —  A3)  =  Ai  +  A3  —  1  —  A1A3  =  trGA*  —  2  —  det  Ga*,  (10) 
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which  gives  the  second  of  (5).  We  note  that  the  cofactor  conditions  discussed  in  [17]  are  the 
conditions  that  i)  the  middle  eigenvalue  of  Ui  is  1  and  ii)  g( A)  is  identically  0,  i.e.,  that  both  £  and 
i]  vanish.  In  this  case  det(U2  —  I)  =  0,  and  the  second  equation  in  (9)  remains  valid  in  the  modified 
form  £  =  —2a  •  UiCof(Uf  —  I)n)  =  0. 

We  let  a  =  Ra  with  R  obtained  as  in  this  remark,  that  is.  B  A  =  a  0  n.  A  simple  transition 
layer  is  the  one  shown  in  Figure  4.  It  is  kinematically  possible  because,  using  (5),  there  is  a  matrix 
C  such  that 

C  —  I  =  f  ®  m,  Cv  =  Av,  Cw  =  Bw.  (11) 

(see  Figure  4).  To  verify  this,  note  first  that 

v  =  — - — -  ((1  —  Ajenr1  —  an-1)  ,  w  =  — - — -  (Aem-1  +  an-1)  (12) 

n  •  m-1-  v  '  n  •  m-1-  v  ' 

for  some  a  >  0,  which  is  the  height  of  the  triangle  as  shown  in  Figure  4  as  one  can  see  by  computing 

— v  •  m. 

Now  it  can  be  seen  that  the  following  expression  for  C  satisfies  (11): 

C  =  I  +  f®m,  f  =  b  +  —  A(1  —  A)a.  (13) 

a 

The  matrix  C  is  of  course  not  on  the  energy  wells. 


6  Energy  barrier  of  a  fully  developed  austenite/martensite 
interface 


Consider  the  case  of  cooling  from  austenite  to  form  martensite.  We  picture  a  lenticular  nucleus, 
stabilized  by  a  defect  (e.g.,  precipitate,  triple  junction  on  a  grain  boundary),  and  we  imagine  that 
this  nucleus  is  already  present  at  temperatures  above  Ms.  Our  main  hypothesis  is  that  this  nucleus  is 
already  twinned,  so  its  boundary  consists  of  two  fully  developed,  slightly  bowed  austenite/martensite 
interfaces.  We  ignore  the  ends  of  the  nucleus,  and  picture  simply  a  twinned  laminate  of  martensite 
of  length  £  (i.e.,  the  half- width  of  the  nucleus)  meeting  austenite.  As  sources  of  energy,  we  consider 
the  interfacial  energy  on  the  twin  boundaries  modeled  by  a  sharp  interface  theory,  the  bulk  elastic 
energy  stored  near  the  austenite/martensite  interface,  and  the  bulk  free  energies  of  the  two  phases. 
We  analyze  the  energy  as  a  function  of  £  accounting  for  the  possibility  that  the  twin  density  and 
elastic  transition  layer  may  relax  as  £  is  changed.  In  this  section  we  consider  a  simple  model  for  the 
transition  layer  that  is  improved  later  in  this  paper. 

We  consider  the  case  in  which  the  middle  eigenvalue  A2  of  the  matrix  Ui  is  close  to  1  and,  for 
simplicity,  we  fix  the  other  eigenvalues2.  In  a  basis  for  which  Ui  is  diagonal,  we  then  evaluate  the 
formula  (5)  for  5: 


S  =  (aini) 


Ai 


A? 


+  (CI2A2)  To - 7  +  (a3^3)- 


A| 


1 


A| 


(14) 


2The  other  eigenvalues  Ai,A3  could  also  be  allowed  to  vary  as  long  as  they  are  bounded  away  from  1;  this  of 
course  forbids  the  cubic  to  tetragonal  case,  since  in  that  case  A2  =  1  and  Ai  <  A2  <  A3  imply  that  Ai  =  1  or  A3  =  1. 
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This  is  dominated  by  the  middle  term  and  we  note  that,  typically,  as  A2  approaches  1,  the  quantity 
a2n2  does  not  tend  to  zero3.  In  view  of  the  first  condition  of  (5)  we  have  the  following:  assuming 
that  cl2ti2  |  a2=i  7^  0  then  for  a  given  twin  system  the  crystallographic  theory  has  solutions 
for  at  most  one  of  the  regions  A2  >  1  or  A2  <  1.  (See  also  Items  1  and  2  below.) 

We  now  give  a  more  detailed  study  of  this  in  the  cubic-to-orthorhombic  case,  which  applies  to 
all  the  NiTiX  alloys  described  above.  In  this  case  the  six  transformation  strain  matrices  are, 
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Written  this  way  Ai,  A2,  A3  are  the  eigenvalues  of  each  of  these  matrices,  and,  for  the  application 
to  the  TiNiX  alloys  studied  here,  we  can  without  loss  of  generality  assume  these  are  ordered, 
Ai  <  X2  <  A3.  We  consider  twin  systems  that  consist  of  variant  1  twinned  with  all  other  variants; 
it  is  sufficient  to  consider  these  by  symmetry.  Variant  1  is  compound  twinned  with  variant  2,  but  it 
forms  Type  I/Type  II  twins  with  the  other  four  variants.  (Thus,  all  pairs  of  variants  are  twinned). 
By  direct  calculation,  in  the  notation  of  (14),  we  have  the  expressions  for  a2n2  listed  in  Table  2  at 

a2  =  1. 


Table  2:  Values  of  a2n2  for  various  twin  systems  at  A2  =  1.  Cubic-to-orthorhombic  case. 


twins  between  variants  1  and  2 

twins  between  variants  1  and  3,  4,  5,  6 

value  of  a2n2 

A§— 1 

A?(3A§-  1)— 2A§ 

(Type  I  twins) 

A3+1 2 

2(2A|+Af  (1+A§)) 

value  of  a2n2 

A§  — 1 

A§+2A?-3 

(Type  II  twins) 

*3+1 

2(Ag+2A^+l) 

The  numerators  of  the  expressions  in  the  last  column  of  Table  2,  for  eigenvalues  near  1,  are  close 
to  A§  —  1  +  2(A)*  —  1)  which  is  typically  negative  for  experimental  cases  and  also  for  any  case  which 
is  nearly  volume  preserving.  Hence,  under  this  typical  situation  that  entries  in  the  last  column  of 
Table  2  are  negative,  we  reach  the  following  conclusions: 

1.  A2  <  1.  Only  compound  twins  form  austenite/martensite  interfaces.  Altogether,  there  are 
only  six  of  these  twin  systems,  two  each  for  variants  1-2,  3-4,  5-6. 

2.  A2  >  1.  All  of  the  Type  I  and  Type  II  twin  systems  can  form  austenite/martensite  interfaces. 

3  “Typically” ,  here  means  that  if  Ui  is  changed  to  pU  1  for  p  near  1,  the  middle  eigenvalue  of  pUi  can  be  made 
to  pass  through  1  but  a2n2  is  only  changed  to  pa2n2. 
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This  striking  conclusion  is  consistent  with  results  of  Hane  and  Shield  [14].  Experimental  values  of 
the  dominant  factor  a2n2  for  alloys  in  which  A2  =  1  are  given  in  Table  3.  Within  a  given  twin  type, 
these  numbers  do  not  vary  greatly  among  these  various  systems. 


Table  3:  Values  of  a2n2  for  alloys  of  TiNiX,  X  =  Cu,  Pt,  Pd,  Au,  at  compositions  where  A2  =  1 


twins  between  var.  1  and  2 

twins  between  var.  1  and  3,  4,  5,  6 

Ti5oNi26Cu2oP  d4 
(Type  I  twins) 

0.0534 

-0.0137 

TisoNi^Cu^P  d4 
(Type  II  twins) 

0.0534 

-0.0103 

Ti5oNi3gPdn 
(Type  I  twins) 

0.0651 

-0.024 

Ti5oNi3gPdn 
(Type  II  twins) 

0.0651 

-0.018 

Ti5oNi43Pt7 
(Type  I  twins) 

0.0645 

-0.0212 

Ti5oNi43Pt7 
(Type  II  twins) 

0.0645 

-0.0157 

Ti5oNi37Aui3 
(Type  I  twins) 

0.0606 

-0.018 

Ti5oNi37Aui3 
(Type  II  twins) 

0.0606 

-0.0133 

One  should  remark  that  even  though  the  crystallographic  theory  forbids  certain  twins  to  par¬ 
ticipate  in  the  austenite/martensite  interface,  all  the  types  of  twins  listed  above  are  possible  in  the 
martensite.  For  example,  in  cases  where  compound  twins  are  forbidden  by  the  crystallographic  the¬ 
ory  to  be  present  in  austenite/martensite  interfaces,  they  could  still  be  observed  in  the  martensite, 
produced  by  stresses  built  up  during  transformation,  especially  in  a  polycrystal. 

Assume  that  we  are  in  one  of  the  regions  where  the  crystallographic  theory  has  a  solution,  that 
is,  the  second  of  (5)  holds  and  either  A2  >  1  and  a2n2  <  0  or  A2  <  1  and  a2n2  >  0.  From  (6)  we 
then  have  that  in  the  regime  of  interest  in  this  paper  A*  ~  1.  We  confine  attention  to  the  solution 
A  =  1  —  A*.  Thus,  the  volume  fraction  A  ~  0,  i.e.,  it  is  the  volume  fraction  of  the  variant  that  is 
paired  with  variant  1  that  is  disappearing  as  A2  — >  1. 

Thus  the  martensite  region  contains  a  large  volume  fraction  where  the  deformation  gradient  is 
A.  Also,  from  equations  (4)  and  (13)  we  have 

A  ^  I  +  b  ®  m,  C  ^  I  +  b  ®  m  (16) 

as  A2  — »  1  with  e/a  fixed.  Since  C  is  near  A  in  this  limit,  it  is  natural  to  estimate  the  energy 

of  C  using  a  quadratic  approximation  of  the  free  energy  density  cp  near  A.  We  would  also  like 

to  optimize  over  the  choice  of  a  (see  Fig.  4),  but  we  keep  e  fixed  for  now.  Later,  after  we  have 

introduced  interfacial  energy,  we  will  optimize  over  the  choice  of  the  “fineness”  e. 

Note  that  C  =  I  +  f  ®  m  =  I  +  b  ®  m  +  (e/a)A(l  —  A)a  ®  m,  and  from  (4)  we  have  A  = 
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I  +  b  ®  m  —  Aa  ®  n.  Thus,  as  must  be  true,  rank(C  —  A)  =  1,  i.e., 


C  =  A  +  a 


An  H — A(1  —  A)m 
a 


(17) 


For  our  simple  estimate  of  the  energy  of  C  we  let  p,  be  a  typical  elastic  shear  modulus,  we  note 
that  the  area  of  each  triangle  containing  C  is 


ea 


2m  •  n-1 


(18) 


(Recall  that  by  definition  m  rU  >  0.)  At  this  point  it  is  convenient  to  fix  one  value  of  the  free 
energy  by  putting  <p(A,  9)  =  0,  so  the  free  energy  of  C  will  be  positive.  This  of  course  implies  that 
(p(I,  9)  >  0.  Suppose  the  width  (into  the  page)  of  the  martensite  plate  is  w.  Then,  we  can  estimate 
the  energy  of  C  by  using  a  linear  elasticity  theory4  obtained  by  linearizing  about  A.  Using  (18) 
and  letting  =  A_Tn  +  {e/a){l  —  A)A_Tm  (so  that  CA-1  =  I  +  Aa  ®  c.x),  we  have 
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The  triangle  height  a  is  contained  in  the  prefactor  as  well  as  in  the  expression  for  c\.  Minimization 
of  this  energy  over  a  >  0  gives 
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We  insert  this  value  of  a  from  now  on.  Then,  assuming  the  height  of  the  martensite  plate  is  h, 
giving  h/s  triangles,  we  then  have 


total  energy  of  the  triangles  =  ewhn\2(l  —  A)£, 
where  the  geometric  factor  f  is  given  by 
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(21) 
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The  geometric  factor  measures  in  a  rough  way,  i.e.,  for  a  simple  model  of  the  transition  layer, 
the  effect  that  different  twin  systems  have  on  the  elastic  energy  stored  in  the  austenite/martensite 

4 Given  that  the  crystallographic  theory  is  geometrically  exact,  one  could  worry  about  the  possible  role  of  finite 
rotations  in  this  calculation,  and  therefore  be  led  to  use  nonlinear  elasticity  instead  of  linear  elasticity  to  estimate  the 
energy  of  C.  The  T-convergence  argument  given  later  in  this  paper  justifies  the  use  of  linearized  elasticity  (linearized 
about  A)  in  this  context  for  A  near  1. 
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interface.  It  depends  only  on  the  data  that  goes  into  the  crystallographic  theory,  that  is,  the  energy 
wells  and  lattice  parameters.  It  is  interesting  to  look  at  values  of  the  geometric  factor  for  various 
twin  systems.  This  is  done  in  Table  4  for  the  alloy  TijoNiso-.rPd,.,  x  ~  11.  It  can  be  seen  that 
there  is  a  very  wide  variation  of  almost  two  orders  of  magnitude  of  this  number,  and  it  also  varies 
significantly  within  a  given  type  of  twin.  This  partly  motivated  the  more  careful  treatment  of  the 
transition  layer  given  later  in  this  paper.  It  may  possibly  explain  why  certain  twins  predominate  in 
certain  alloys.  It  also  suggests  that  within  a  certain  twin  type,  certain  pairs  of  variants  would  tend 
to  be  rather  strongly  preferred. 


Table  4:  Geometric  factor  £  for  various  twin  systems  for  Ti50Ni5o-xPdx,  x  ~  11.  Note  the  wide 
variation.  Components  in  the  cubic  basis. 


variants 

twin  type 

habit  plane 

£  x  103 

1  and  2 

compound 

(-0.5005,  0.5005,  -0.7064) 

8.07 

3.47 

1  and  3 

Type  I 
Type  II 

(-0.5005,  0.5005,  -0.7064) 

48.6 

19.8 

1  and  4 

Type  I 
Type  II 

(-0.5005,  0.5005,  -0.7064) 

14.7 

2.07 

1  and  5 

Type  I 
Type  II 

(-0.5005,  0.5005,  -0.7064) 

23.4 

162 

1  and  6 

Type  I 
Type  II 

(-0.5005,  0.5005,  -0.7064) 

3.64 

16.9 

1  and  2 

compound 

(0.5005,  -0.5005,  -0.7064) 

3.47 

8.07 

1  and  3 

Type  I 
Type  II 

(0.5005,  -0.5005,  -0.7064) 

23.4 

162 

1  and  4 

Type  I 
Type  II 

(0.5005,  -0.5005,  -0.7064) 

3.64 

16.9 

1  and  5 

Type  I 
Type  II 

(0.5005,  -0.5005,  -0.7064) 

48.6 

19.8 

1  and  6 

Type  I 
Type  II 

(0.5005,  -0.5005,  -0.7064) 

14.7 

2.07 

Now  we  repeat  the  classical  calculation  with  this  model  of  the  transition  layer,  except  we  have 
the  austenite  and  martensite  wells  at  different  heights.  We  introduce  an  interfacial  energy  per  unit 
area  k  on  the  twins  and  write  the  total  energy  as 

2kwM 

total  energy  = - |  £w/i/uA2(l  —  A)£  +  <p( A,  9)wh£  +  <p( I,  6)wh(L  —  £),  (23) 

where  L  is  the  total  length  of  the  strip,  including  martensite  and  austenite  (L  »  £).  We  minimize 
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this  expression  over  e  >  0  and  get 


/  2k£ 

£  -  y^i  -  \)c  (  } 

giving  an  inverse  relation  between  twin  spacing  s  and  volume  fraction  A  in  the  limit  as  A  — >  0  (i.e., 
as  A2  — >  1).  This  gives  the  total  minimum  energy 


2whX\/2KfjI(l  —  A)£  +  wh£^p( A,  9)  —  <p( I,  9 +  const.  (25) 

This  has  the  typical  form  of  an  energy  with  a  single  barrier  that  arises  in  a  nucleation  calculation. 
The  \ft  term  dominates  for  small  £  and  the  linear  term  dominates  for  large  £.  Recall  that  we  have 
assumed  that  0  =  <p( A,  9)  <  (p(I,  9 );  thus,  the  energy  goes  to  —00  as  £  — >  00.  Focusing  now  on  the 
dependence  of  (25)  on  we  see  that  the  energy  (25)  has  a  single  maximum  at 

2AV(1  -  A)£ 
max  (^(A,0)-^(I,0))2 

The  value  of  the  total  energy  at  £max  is 

energy  at  the  barrier  = 

Note  that  the  energy  at  the  barrier  and  £max  are  both  proportional  to  A2.  Thus,  as  A2  — >  1  both 
fmax  and  the  height  of  the  barrier  tend  to  0  at  the  same  rate. 

Now  we  can  make  a  simple  criterion  based  on  the  idea  that  typical  defects  in  the  material 
stabilize  fully  developed  austenite/martensite  interfaces  at  a  value  £  =  tc.  Then  such  interfaces  are 
expected  to  move  when  <p(  1, 9)  —  if  (A,  9)  is  increased  (by  decreasing  temperature)  to  a  point  where 
f-max  given  by  (26)  exceeds  £c.  This  gives  the  criterion  for  the  size  of  the  hysteresis, 


2X2whK/j,(l  —  A)£ 


(26) 


2AV(1  -  \)S  ( 

MA,e)-v(i,e))2 


To  evaluate  the  denominator  of  this  expression,  consider  a  Taylor  expansion  of  each  energy  well 
about  9  =  9C.  The  minimizers  A,  I  on  the  martensite  and  austenite  wells,  respectively,  depend 
weakly  on  temperature,  so  we  expand  the  notation  by  writing  A(0),  1(0);  these  are  still  chosen  to  lie 
on  the  wells  and  to  satisfy  the  equations  of  the  crystallographic  theory  as  given  above.  Expanding 
<p  at  each  well  about  9  =  9C  we  have 


¥>(A(0),#) 


+  (MMA22  +  .  A-(n)  (9  _ 

M  fc)  +  .  r(n)  (« -  (29) 


where  9*  is  a  suitable  intermediate  temperature  between  9  and  9C  (possibly  different  in  the  two  lines 
above).  Since  A(0*),I(0*)  are  local  minimizers  of  <p  at  9*,  then  the  second  terms  within  the  large 
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parentheses  vanish.  If  we  assume  that  the  entropy  density  (=  —dcp/dO)  is  constant  on  the  austenite 
and  martensite  wells,  respectively,  for  temperatures  in  the  interval  ( 9,9C ),  identify  the  quantity 


L  =  9r 


fy>(I(0c),0c)  dip{A  (9C),9C) 


(30) 


d0  d0 

as  the  latent  heat,  and  use  that  the  wells  are  at  the  same  height  at  9C,  we  get  by  subtraction  of  (29) 

6r-6 


<p{A,9)-<p{I ,9)  =  L- 


6r 


(31) 


We  identify  (1/2 )H  =  (9C  —  9)  as  the  half  width  of  the  hysteresis  when  (28)  holds.  This  gives  the 
criterion 


Ai 


'2/qu(l  -  A)f 


=  V>(  A,  «)-¥>(!  ,0)  =  L 


».-«  lrrrl 

a.  2HLo' 


(32) 


that  is,  the  width  of  the  hysteresis  is 


(33) 

To  compare  with  Figure  3,  the  dependence  of  H  on  A2  near  A2  =  1  is  of  interest.  This  arises 
from  the  dependence  of  A  on  A2  in  (33).  This  is  given  by  A  =  1  —  A*  with  A*  evaluated  using  (6) 
and  (14).  Its  dominant  term  as  A2  approaches  1  is 


A(A2) 
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2- 


-2|A| 
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+  1 
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The  slope  at  A2  =  1  is 


A'(A2) 


l/|a2n2|,  A2  —  1+, 
—  l/|a2n2|,  A2  =  1  —  • 


(34) 


(35) 


Putting  (35)  into  (33)  we  get  a  formula  for  the  asymptotic  size  of  the  hysteresis.  This  formula  gives 
a  qualitative  prediction  like  that  seen  in  the  experimental  data  (Figure  3):  two  lines  meeting  at  a 
sharp  corner.  Note  that  in  Figure  5  the  graph  is  not  symmetric  about  A2  =  1.  That  is  because  both 
a2n2  and  £  depend  on  the  twin  system  which,  as  noted  above,  is  different  on  the  different  sides  of 
A2  =  1. 

Note  that  in  Table  3  the  values  of  a2n2  at  A2  =  1  for  a  given  twin  system  do  not  vary  much  among 
the  alloys  Ti-Ni-(Pd,  Pt,  Au,  Cu-Pd).  Similarly,  the  geometric  factor  for  a  given  twin  system  also 
varies  only  slightly  among  these  alloys.  The  other  parameters  /j,  9C,  k,  L,  tc  that  enter  the  formula 
(33)  for  the  hysteresis  are  also  not  expected  to  vary  greatly  among  these  alloys.  Evidently,  this 
insensitivity  suggests  a  reason  for  the  universality  of  the  measured  graph  of  hysteresis  vs.  A2,  Figure 
3.  According  to  the  present  theory,  this  universality  is  not  expected  to  extend  widely  to  other  alloys: 
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AV<l-A)f 


Figure  5:  Plot  of  the  factor  in  the  formula  for  the  hysteresis  H  vs.  A2  that  depends  on  lattice 
parameters.  For  this  plot  the  other  eigenvalues  are  determined  by  Ai  =  0.93,  A1A2A3  =  1  and  the 
twin  system  is  assumed  to  be  the  one  with  the  lowest  geometric  factor  £  at  the  given  A2. 


all  of  the  parameters  listed  in  this  paragraph  can  be  highly  variable  among  different  martensitic 
materials. 

These  results  are  also  valid  away  from  A2  =  1.  Therefore  it  is  interesting  to  plot  the  factor 
A  \J ( 1  —  A)£  in  the  formula  for  the  hysteresis  if  vs.  A2.  This  is  the  factor  that  depends  explicitly 
on  lattice  parameters.  A  plot  of  A yj (1  —  A )£  vs.  A2  is  shown  in  Figure  5.  Comparing  Figure  5  with 
that  of  the  data  in  Figure  3,  one  sees  a  remarkable  similarity. 

We  cannot  give  more  quantitative  results  because  of  the  lack  of  knowledge  of  tc  and  of  the 
interfacial  energy  constant  k.  Regarding  the  latter,  among  all  shape  memory  alloys,  there  are  very 
few  measured  inter  facial  energies,  those  that  are  available  are  considered  highly  approximate  by 
most  workers.  Schryvers  [38]  studies  the  twin  branching  phenomenon  in  NiAl.  From  the  associated 
scaling  law  for  twin  spacing  vs.  distance  from  the  habit  plane  he  extracts  a  prefactor,  and  then,  by 
use  of  an  estimate  of  the  elastic  energy  of  the  transition  layer  (similar  to  that  done  here)  he  gets 
an  interfacial  energy,  roughly  estimated  at  1  mJ/m2.  For  Type  I  twins  in  CuAINi  Shilo  et  al.  [42] 
estimate  70  mJ/m2  by  fitting  measured  profiles  of  twin  boundaries  to  a  Landau  theory.  Waitz  et 
al.  [45]  extracts  atomic  positions  from  high  resolution  images  of  twin  boundaries  and  uses  these  to 
assign  atomic  positions  for  density  functional  theory  (DFT)  calculations,  arriving  at  a  value  of  f4 
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Table  5:  Material  constants  used  to  estimate  £c  for  TisoNiso-ajPda,,  x  =  11 


material  constant 

reference 

latent  heat 

L  =  0.231  J/g 

Table  1,  this  paper 

transformation  temperature 

6C  =  298  K 

Table  1,  this  paper 

elastic  modulus 

fi  =  27  GPa 

ref.  [33] 

density 

p  =  6.85  g/cm3 

Table  1,  this  paper 

lattice  parameters 

Ai  =  0.9280,  A2  =  1.0001,  A3  =  1.0674 

Table  1,  this  paper 

Interfacial  energy  density 
(compound  twins) 

ft  =  25  mJ/m2 

estimated,  see  text 

^,(-^2)|a2=i-  =  —  l/a2n2 

-15.36 

Table  3,  this  paper 

H'{ A2)  A2=i- 

-3000  K 

Figure  3,  this  paper 

mJ/m2  for  the  compound  twins  and  187  mJ/m2  for  the  Type  I  twins  in  monoclinic  TiNi.  It  should 
be  mentioned  in  this  context  that  minimum  energy  calculations  by  Huang  et  al.  [16]  using  several 
different  implementations  of  DFT  do  not  give  the  accepted  stable  phase  of  NiTi  at  0  =  0  (i.e.,  the 
one  seen  in  the  high  resolution  images  of  Waitz  et  ah  [45]).  O’Handley  et  ah  [32]  give  40  mJ/m2 
for  the  compound  twins  in  the  tetragonal  phase  of  an  alloy  of  Ni2MnGa. 

Despite  these  uncertainties,  it  is  useful  to  work  backwards  from  the  hysteresis  data  in  Figure  3 
to  a  value  of  the  critical  length  £c.  To  do  this,  we  use  twin  systems  with  the  smallest  geometric 
factor,  as  these  will  lose  stability  first  according  to  the  present  viewpoint,  and  we  consider  the  alloy 
TisoNiso-xPd^,  x  ~  11.  For  this  alloy,  by  direct  calculation  based  on  (22),  the  lowest  geometric 
factors  are  obtained  from  the  Type  II  twins  of  variants  1  and  4  (geometric  factor  2.07  x  10-3) 
and  the  compound  twins  of  variants  1  and  2  (geometric  factor  3.47  x  10-3),  and  their  symmetry 
related  twin  systems.  The  slopes  of  the  of  the  measured  hysteresis  data  can  then  be  used,  together 
with  other  assumed  or  measured  material  constants  in  Table  5,  to  give  a  value  of  £c  using  (33). 
The  values  we  obtain  by  that  procedure  are  ic  =  17.5/un  (compound)  and  ic  =  10.4/iin  (Type  II). 
Considering  the  uncertainty  in  the  interfacial  energy  and  other  constants,  these  numbers  could  be 
easily  off  by  factors  of  2  or  3.  However,  they  do  suggest  rather  large  values  of  £c  which  is  consistent 
with  the  viewpoint  adopted  in  this  paper  that  the  growth  of  fully  developed  austenite/martensite 
interfaces  is  responsible  for  determining  the  size  of  the  hysteresis. 

7  Remarks  and  comparisons  with  other  ideas  about  hys¬ 
teresis 

It  should  be  emphasized  that  the  simple  theory  given  above  gives  zero  hysteresis  when  A2  —  1, 
whereas  the  measured  thermal  hysteresis  decreased  to  between  5  and  15  C  in  the  alloys  studied. 
Possible  reasons  for  this  discrepancy  are  that  1)  the  alloys  studied  experimentally  did  not  sufficiently 
closely  satisfy  A2  =  1  or  were  not  sufficiently  homogeneous,  2)  the  theory  did  not  assign  an  energy  to 
the  exact  interface  between  austenite  and  martensite  in  the  case  A2  =  1,  and  3)  other  explanations 
besides  the  crossing  of  an  energy  barrier  associated  with  a  fully  developed  austenite/martensite 
interface,  such  as  thermal  activation  (but  see  below)  or  pinning  of  interfaces  by  defects. 
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In  the  above  we  have  attributed  the  crossing  of  the  barrier  to  the  presence  of  defects  in  the 
material,  but  a  competing  explanation  is  thermal  activation,  calculated,  for  example,  by  using 
transition  state  theory.  In  this  context  it  is  useful  to  compare  the  barrier  height  we  have  calculated  to 
k9.  For  this  purpose  we  need  also  values  of  other  dimensions  h,  w  of  the  martensite  plate.  In  almost 
all  observations  of  the  first  appearance  of  martensite,  the  plate  is  a  thin  plate,  h  3>  £c,  w  ic.  We 
then  obtain  a  highly  conservative  estimate  of  the  applicability  of  transition  state  theory  by  putting 
h  =  w  =  lc.  Using  the  smallest  value  ic  =  10.4  //,m  calculated  above,  we  get  a  barrier  height  of 
Eb  =  9.0  x  10-11  J,  while  at  6  =  6C  we  have  kd  =  4.4  x  10-21  J.  Thus  Eb/k0c  ~  2  x  1010  and  it 
is  seen  that  the  crossing  of  the  energy  barrier  we  have  calculated  here  by  a  thermal  fluctuation  is 
exceedingly  unlikely. 

The  latent  heat  L  enters  the  formula  (33)  for  the  hysteresis  in  the  denominator.  In  a  sense  6C/ L 
sets  the  scale  for  conversion  of  energy  to  temperature  difference.  All  else  the  same,  materials  with 
large  latent  heat  will  then  have  small  apparent  hysteresis.  This  is  opposite  to  conventional  wisdom, 
which  holds  that  large  latent  heat  implies  large  hysteresis  because  of  the  difficulty  of  getting  the  heat 
in  and  out  of  the  material  during  transformation.  The  latter  effect  has  been  studied  in  detail  by  Leo 
and  Shield  [23]  and  Shaw  and  Kyriakides  [41].  As  demonstrated  by  these  authors,  this  is  a  dynamic 
effect  that  has  a  negligible  effect  on  the  hysteresis  loop  with  sufficiently  slow  transformation.  Also, 
the  relative  changes  of  the  size  of  the  hysteresis  due  to  this  effect  are  rather  small.  In  any  case  the 
formula  (33)  for  H  is  not  intended  to  capture  this  dynamic  effect. 

Materials  with  weak  first  order  transformations  often  have  relatively  small  hysteresis.  In  such 
materials,  all  the  eigenvalues  of  the  transformation  strain  matrix  are  close  to  1,  and  in  materials 
with  small  volume  change,  AiA2A3  ~  1,  A2  is  particularly  close  to  1.  In  addition,  the  predicted 
asymptotic  slopes  of  H  vs.  A2  at  A2  =  1  tend  to  infinity  as  A i ,  A3  — »  1,  because  in  that  case 
a2n2  — » ►  0  (see  (35)).  Thus,  the  predicted  graph  of  hysteresis  H  vs.  A2  is  sharper  in  the  case  of  weak 
transformations,  implying  that  one  needs  to  approximate  A2  =  1  with  greater  precision  in  order  to 
achieve  minimal  hysteresis.  However,  k  and  L  are  also  expected  to  be  small  for  such  cases,  so  it  is 
not  clear  how  the  hysteresis  would  be  expected  to  behave  for  weak  first  order  transformations.  The 
asymptotic  behavior  of  k  and  L  in  cases  where  Aij2>3  ~  1  should  be  amenable  to  theoretical  analysis 
and  it  would  be  interesting  to  make  predictions  about  this  case. 

While  this  paper  has  concentrated  on  cubic  to  orthorhombic  NiTiX  alloys,  there  is  significant  ev¬ 
idence  in  the  literature  that  the  ideas  presented  here  apply  broadly.  Some  of  this  evidence,  including 
comparisons  with  Cu-based  martensitic  materials  undergoing  cubic  to  monoclinic  transformations, 
is  given  in  [17].  The  use  of  single  crystals  vs.  polycrystals  also  does  not  affect  greatly  the  hysteresis, 
also  in  accord  with  the  theory  given  here.  Qualitatively,  the  ideas  appear  to  extend  to  stress-induced 
transformation.  In  principle,  in  the  case  of  stress-induced  transformation,  one  should  have  to  ac¬ 
count  for  the  effect  of  stress  on  the  local  values  of  lattice  parameters.  The  presence  of  a  correlation 
between  A2  ~  1  and  the  size  of  the  hysteresis  in  stress-induced  cases  is  evidently  a  reflection  of 
the  smallness  of  the  perturbation  of  the  lattice  parameters  caused  by  the  typical  stresses  used  in 
such  experiments.  The  effect  of  orientation  on  hysteresis  seen  in  the  experiments  of  Horikawa  et 
al.  [15]  and  of  Shield  [40]  can  be  understood  qualitatively  [51]  by  using  the  measured  elastic  moduli 
to  estimate  the  effect  of  stress  on  lattice  parameters,  so  as  to  determine  whether  the  stress  moves 
the  A2  closer  to  or  further  from  the  value  1. 

One  puzzling  fact  about  the  experimental  results  concerns  the  numbers  of  strains  and  numbers 
of  variants.  One  would  expect  that  these  numbers  might  correlate  with  hysteresis,  as  large  numbers 
of  strains,  for  example,  would  seem  to  give  more  alternatives  for  accommodating  stresses  built  up 
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in  a  polycrystal  due  to  different  grain  orientations,  and  thereby  would  reduce  or  eliminate  some 
energy  barriers.  But  the  data  shown  in  Figure  3  is  fairly  symmetric  about  A2  =  1,  even  though 
the  number  of  twins  that  participate  in  the  austenite/martensite  interface  is  drastically  different 
for  A2  <  or  >  1,  as  indicated  in  Section  6.  The  actual  numbers  of  twin  systems  participating  are 
6  systems  for  A2  <  1  and  24  systems  for  A2  >  1.  There  are  4  austenite/martensite  interfaces  per 
twin  system  giving  rise  to  24  (resp.,  96)  interfaces  for  A2  <  1  (resp.,  >  1).  Exactly  at  A2  =  1, 
where  the  hysteresis  is  lowest,  these  degenerate  to  only  12  austenite/martensite  interfaces.  Thus, 
surprisingly,  there  appears  to  be  no  obvious  connection  of  the  numbers  of  strains  or  numbers  of 
interfaces  to  the  size  of  hysteresis.  This  indicates  again  that  the  main  energy  barrier  is  the  growth 
of  an  austenite/martensite  interface,  and  the  numbers  of  such  interfaces  is  of  secondary  importance 
for  the  size  of  the  hysteresis.  However,  these  numbers  could  well  play  an  important  role  for  other 
properties,  for  example  the  lifetime  of  the  material  under  repeated  transformation.  Interesting 
unpublished  data  of  Quandt  [36]  supports  this  view.  The  satisfaction  of  the  cofactor  conditions 
described  above  achieves  both  a  dramatically  larger  set  of  strains  and  interfaces,  as  well  as  A2  =  1. 

8  Theory  of  the  transition  layer  between  austenite  and 
martensite  when  A2  ~  1 

We  now  develop  ideas  for  a  transition  layer  that  can  provide  a  more  accurate  evaluation  of  the  bulk 
energy  of  the  transition  layer  than  the  one  given  above  by  simply  using  the  matrix  C. 

8.1  Kinematic  assumptions  and  boundary  conditions 

We  now  formulate  assumptions  for  a  more  general  but  still  simplified  transition  layer.  To  begin,  we 
look  at  common  features  of  the  deformation  gradients  that  enter  the  boundary  conditions  for  the 
transition  layer.  These  are  matrices  A,  B,  I  satisfying 

B  —  A  =  a  ®  n,  (AB  +  (1  —  A)  A)  —  I  =  b  ®  m.  (36) 

Let  e  be  a  unit  vector  perpendicular  to  both  m  and  n,  and  let  p  be  a  unit  vector  perpendicular  to 
both  a  and  b.  From  (36) 


Ae  =  Be  =  Ie  =  e  and  Arp  =  BTp  =  ITp  =  p. 


(37) 


Now  we  write  the  boundary  conditions  for  a  transition  layer.  On  the  left  (see  Figure  6)  we 
have  a  martensite  laminate.  Its  deformation  can  be  written  with  the  aid  of  the  following  periodic 
function  with  period  1: 


Xx(s) 


1  —  A,  0  <  s  <  A, 
-A,  A  <  s  <  1. 


(38) 


Note  that  xa  has  mean  0,  so  its  integral  is  also  1-periodic.  The  deformation  of  the  martensite  phase 
consistent  with  Figure  4  and  the  calculations  above  can  be  written 


y m(x)  =  (AB  +  (1  -  A)A)x  +  ea 


XA(s)  ds. 


(39) 
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By  direct  calculation 


VyM(x) 


B,  ei  <  x  •  n  <  e(i  +  A), 

A,  e(i  +  A)  <  x  •  n  <  e(i  +  1), 


(40) 


In  the  austenite  we  have  Vy^  =  I,  so  yn(x)  =  x  +  c.  In  principle,  the  presence  of  c  may  be  impor¬ 
tant;  there  should  be  a  stiffness  associated  with  changes  of  c.  However,  in  the  present  treatment 
we  shall  assume  (below)  that  the  deformation  gradient  in  the  B-layers  is  constant.  This  and  the 
continuity  of  the  deformation  at  0  will  then  give  a  formula  for  c. 

We  write 

y(x)  =  z(x)  +  x.  (41) 

We  note  from  the  expressions  for  y m  and  y a  that  Vy =  Vy =  e.  We  assume  that  this 
property  also  extends  to  the  transition  layer: 


Vze  =  0.  (42) 

We  can  introduce  coordinates 

x  =  Xi\iL  +  X2mL  +  x3e.  (43) 

Then  we  have  by  assumption  z(x\,X2)  Note  that  these  coordinates  are  non-orthogonal  (typically, 
m-1  •  ir1  0)  but  recall  we  have  assumed  m-1  •  m-1  =  ir1  •  ir1  =  1  and  nr1  •  n  =  m  •  ir1.  Note  also 
x  •  n  =  (n  •  m±)x2.  As  pictured  in  Figure  6,  the  austenite/martensite  interface  is  x  •  m  =  0,  that 
is,  Xi  =  0.  The  twin  boundaries  are  planes  :i;2  =  const. 
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Now  we  discuss  conditions  of  periodicity,  first  for  the  boundary  conditions.  We  note  that  both 
Ya(x)  —  x  and  y^(x)  —  x  are  periodic  in  the  x2  direction  with  period  £  =  e/{m.L  •  n).  That’s  clear 
for  yA  —  x.  For  ym  we  have,  using  the  second  of  (36), 


r\  (x+CmWn 

yM(x  +  Cm±)  -yjtf(x)  -Cm_L  =  (I  +  b0m)(Cm-L) +  ea  /  yA(s)  ds  -  (m-1, 


/•i(x-n)+l 

=  ea  x\(s)ds  =  0. 


(44) 


Consistent  with  this,  note  that  the  construction  of  the  transition  layer  using  C  also  had  the  property 
that  Vy  is  periodic  in  the  x2  direction  with  period  (  =  e/{m1-  •  n).  Extending  this  hypothesis  to 
more  general  transition  layers,  we  assume  that 


z(.r, .  x2  1-0 


z(xux2), 


C  = 


£ 

•  n 


(45) 


We  do  not  expect  the  twin  boundaries  necessarily  to  remain  planar  up  to  the  austenite/martensite 
interface  so  now  we  parameterize  these  boundaries.  We  assume  a  B-layer  is  confined  by  the  two 
surfaces  (see  Figure  6) 


=  f+(x i),  X2  =  f  (xi),  X\  <  0, 

f+  >  r,  /+(0)  =  r(0)  =  0.  (46) 

By  assuming  that  /+( 0)  =  /“( 0)  =  0  we  have  put  the  origin  in  its  natural  place  at  the  intersection 
of  the  pinched  needle  and  the  austenite/martensite  interface.  The  needle  is  assumed  to  be  pinched 
because  it  is  the  A-layer  that  is  nearly  compatible  with  austenite  when  X2  ~  1.  We  do  not  necessarily 
want  to  assume  the  needle  is  symmetric,  or  even  that  the  centerline  of  the  B-layer  (as  obtained 
from  the  boundary  conditions)  passes  through  the  origin.  Thus,  we  introduce  a  parameter  rj  and 
write  the  boundary  conditions  for  a  martensite  plate  of  length  £  as  the  following: 

y(x)  =  yM(x  —  r]£ n)  at  Xi  =  —£,  where 

/-(-£)  =  r/(  and  f+{-t )  =  (v  +  A)C,  (47) 

(Note  for  example  that  the  B-layer  r/e  <  x  •  n  <  (r)  +  X)e  corresponds  to  r/C,  <  x2  <  (r/  +  A)£.)  The 
parameter  r/  is  also  part  of  the  solution;  we  expect  it  will  emerge  that  //  <  0.  The  displacement  z 
then  satisfies  the  boundary  condition 

z(x)  =  yM(x  -  r]en)  -  x,  xi  =  -£,  (48) 

In  (47)  and  (48)  we  have  imposed  the  left  hand  boundary  condition  of  the  A-layer  at  xi  =  —£. 
Typically,  we  will  want  £  e  so  as  to  capture  the  full  transition  layer. 

For  simplicity  we  assume  constant  gradient  in  the  B-layers: 

z(x)  =  yiw(x  —  r}£  n)  —  x  =  (B  —  I)x  —  r/e  Bn  =  a?i(B  —  I^ir1  +  x2(B  —  I)m'L  —  r/eBn 

for  f~(x1)  <  x2  <  f+(x!).  (49) 
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This  defines  z  on  the  B-layer  attached  to  the  origin.  The  periodicity  condition  z(x\,x2  +  ()  = 
z(xi,  x2)  extends  this  function  to  the  other  B-layers.  For  the  A-layer  f+(x1)  <  x2  <  f~{x i)  +(  we 
therefore  have  the  boundary  conditions, 

z(xi,  f+(xx))  =  a:i(B  —  I)]!-1  + /+(xi)(B  —  I)!!!-1  —  ?7£Bn, 
z(x1,f~(x1)+C)  =  xi(B  —  I)]!1  +  f~(xi)(B  —  I)!!!-1  —  ^Bn,  (50) 

for  xi  <  0,  and,  on  this  layer  (f+(x i)  <  x2  <  f~(x i)  +  £)  we  expect  Vy  ~  SO(3)A.  Making  the 
ansatz  (49)  for  the  B-layers  now  gives  us  a  way  to  evaluate  the  constant  c  by  continuity  at  x  =  0: 

y(0)  =  z(0)  =  c  =  —r]£  Bn.  (51) 

At  the  sharp  interface  with  austenite  as  pictured  in  Figure  6  we  also  have  the  boundary  condition 

z(0,  x2)  =  (yu(x)  -  x)|Xl=0  =  -rjeBn,  0  <  x2  <  (,  (52) 

in  view  of  (51).  Finally,  we  have  the  conditions  at  X\  =  — oo  for  the  A-layer: 

z(—£,  x2 )  =  y m (x  —  r/en)  —  x  =  (A  —  I)x  +  eAa  —  ?7£An  at  x\  =  —£, 

=  xi(A  —  I)!!-1  +  x2(A  —  I)  nr1  +  eAa  —  rjeAn  (53) 

for  (//  +  X)(  <  x2  <  (?7  -|-  1)C-  A  summary  of  boundary  conditions  for  the  A-layer  and  the  conditions 
on  is  given  below. 

Boundary  conditions  on  the  A-layer 


f~(-£) 

z(xi,/+(xi)) 

z(xi,/“(xi)  +  C) 

z(0,  x2) 
z(-£,x2) 


V C  and  f+(-£)  =  (v  +  A)C,  C  =  — f — , 

•  n 

X\(B  —  1)^  +  f+(x i)(B  —  I)m_L  —  ?76:Bn,  X\  <  0, 
xi(B  —  1)^  +  f~(x i)(B  —  I)m_L  —  rje Bn,  x\  <  0, 
—r/eBn,  0  <  x2  < 

—£(A  —  1)^  +  x2(A  —  I)  nr1  +  eAa  —  r]£  An, 

(v  +  A)C  <x2  <  (r/  +  l)C 


(54) 


8.2  Change  of  variables 


We  now  make  a  change  of  variables  that  brings  out  the  structure  of  the  basic  mathematical  problem 
and  removes  the  solution-dependence  of  the  domain  of  integration  for  the  energy.  The  basic  small 
parameter  for  this  problem  can  be  taken  as  either  A  or  A2  —  1,  these  being  related  by  (34). 

For  the  independent  variables  we  change  variables, 


t  =r  t  =  X-1  -  IA?±1 

1  11  2  c-(/+Ui)-rfe))' 

The  Jacobian  of  (the  inverse  of)  this  transformation  is 


_  d(xux2) 

<90 M2)  ^  U 


/“)• 


(55) 


(56) 
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This  transformation  maps  the  domain  {— oo  <  x\  <  0,  f+(x i)  <  X2  <  f~(x i)  +  C}  to  the  fixed 
half- strip  {— oo  <  fi  <  0,  0  <  t2  <  1):  the  A- layer  becomes  a  rectangular  half-strip  in  these  new 
variables.  For  the  dependent  variables  we  make  the  change  to 


u(tr,  t2)  =  —  (z  (ti,  [C  -  (/+(ti)  -  f  (h))]t2  +  f+(t i))  -  ti(A  -  I)^  +  rjeBnj  .  (57) 

Recalling  that  y(x)  =  z(x)  +  x  and  x\  —  x  •  m/(nJ-  •  m),  x2  =  x  •  n/ (ir1  •  m),  (cf.  (43)),  we  denote 

t2f~  +  (1  -  t2)f+'  .  1 


c  -  (/+  -  /-) 


h  = 


C  -(/+-/-)’ 


(58) 


and  then  we  have, 


Vy  =  I 


n-1-  •  m 


(z,i  ®m  +  z,2  0n) 


—  I  +  ^u,i  + 

=  i  +  ^-_ 


n-1  •  m 


(A  —  I)!!-1  —  gu,2  )  ®  m  +  hu,2  ®n 


(A  —  I)!!1-  ®  m  +  (u,i  —  gu,2  )  ®  m  +  hu,2  ®n. 


•  m 


(59) 


From  the  equations  of  the  crystallographic  theory  we  have  A(B  —  A)  +  A  —  I  =  b®m,  implying 
that  (A  —  =  (m  •  nJ-)b.  Substituting  this  into  the  second  term  on  the  right  hand  side  of  (59), 

and  then  using  that  I  +  b  ®  m  =  A  +  Aa  Q  n  we  get  the  convenient  form 


Vy  =  A  +  (u,i  -gn,2 )  ®  m  +  (Aa  +  hn,2 )  0  n.  (60) 

Finally,  directly  from  (54),  we  have  the  boundary  conditions  for  the  function  u. 

Boundary  conditions  for  u(ti,t2) 


u(ti,0) 
u(fi,  1) 

u(0, t2) 

U 


vC  and  /+(-£)  =  (r]  +  A)C,  C=  — 

/+(ti)(l  -  A)a,  ti  <  0, 

/“(fi)(l  -  A)a,  ti  <  0, 

0,  0  <  t2  <  1, 

C  ((A  -t"  Tj)  —  A^)  (1  —  A)a,  0  <  t2  <  1. 


(61) 


8.3  Scaling  of  the  energy 

At  this  point  we  could  do  a  series  of  numerical  energy  minimizations  to  determine  the  energy  in 
the  transition  layer  as  a  function  of  the  parameters.  We  want  to  vary  at  least  the  small  parameter 
A,  the  elastic  modulus  ji  and  the  other  two  eigenvalues  Ai,  A3  in  the  transformation  strain  matrix. 
(The  effect  of  varying  the  fineness  e,  or,  equivalently,  the  periodicity  (,  is  determined  by  the  basic 
scale- invariance  of  elasticity).  We  may  also  wish  to  consider  more  general  elastic  energy  functions. 
Thus,  a  comprehensive  series  of  numerical  minimizations  like  this  could  quickly  become  unwieldy, 
and  it  is  not  clear  whether  a  simple  understanding  of  the  dependence  of  the  parameters  would  be 
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emerge.  This  is  a  good  situation  for  T-convergence,  as  this  method  gives  the  leading  order  energy, 
the  leading  order  minimizer,  and  simple  predictions  of  the  effect  of  changing  other  parameters. 

To  begin,  we  first  need  to  understand  the  basic  scaling  of  the  energy  with  A.  For  this  purpose 
the  earlier  test  function  involving  the  triangles  with  deformation  gradient  C  is  expected  to  give 
the  correct  scaling,  though  not  the  optimal  energy.  This  test  function  is  easily  expressed  in  the 
simplified  variables  of  the  preceding  subsection.  It  corresponds  to  the  choice 


*7  =  0,  /  (h)  =  0,  /+(ti) 


AC,  —  i  <  t\  <  a, 
^■ti,  a  <  ti  <  0, 

a  —  1  ’ 


(62) 


where  a  =  a.j  (nx  •  m),  with  a  <  0  having  the  same  meaning  as  in  Section  5.  The  function  u  is 
chosen  as  the  simple  continuous  function 


u(ti,t2) 


CA(1-*2)(1-A)a, 

CA(1  -  *2)(1  -  (A/h)fi)a, 
(CAti(l  -  A)/d)a, 


—£<  ti  <  a,  0  <  t2  <  1, 
a  <  ti  <  a(t2  -  1)/ (A t2  -  1),  0  <  t2  <  1, 
a(t2  —  1) / (A t2  —  1)  <  t\  <  0,  0  <  t2  <  1. 


(63) 


With  these  choices  one  can  see  that  all  the  boundary  conditions  (61)  for  u  are  satisfied,  and,  using 
the  expression  (60)  we  find  that  the  deformation  gradient  on  the  region  a(t2  —  1)/(A t2  —  1)  <  t\  <  0, 
is 

Vy  =  A  +  u,i  0m  +  Aa0n  =  A  +  a0  (l^il — — m  -|-  An)  =  C,  (64) 

a 

(cf.,  (17)),  and  of  course  outside  this  region  Vy  =  A.  This  is  precisely  the  test  function  used  earlier 
in  this  paper,  expressed  in  the  new  variables.  Therefore  we  can  rely  on  the  earlier  evaluation  (19) 
and  we  see  immediately  that  the  energy  goes  as  A2. 

Beginning  with  a  general  expression  for  the  energy,  we  therefore  divide  the  energy  by  A2  and 
seek  the  T  limit  of 

l 

0  (I  +  (u,i  -gnl2  )  0  A_Tm  +  (Aa  +  hu,2  )  0  A-Tn)  ((  -  (/+  -  /“))  dtidt2,  (65) 


where  (p  is  minimized  on  SO(3),  that  is,  0(F)  =  <y?(FA,  6)  where  <p  is  the  free  energy  used  earlier  in 
this  paper  and  the  dependence  on  6  is  suppressed. 

By  dividing  by  A2  we  are  assured  that  the  rescaled  energy  of  the  test  function  used  earlier 
in  this  paper  is  bounded  away  from  0  and  oo  as  A  — »  0  (cf.,  (21)).  But  of  course,  this  test 
function  is  not  a  minimizer  of  the  energy.  Note  also  that  the  integrand  of  (65)  is  minimized  when 
u,i  =  0,  u, 2  =  —  CA(1  —  A)a,  f~  =  r/C, ,  f+  =  (r/  +  A)C  but  of  course  this  satisfies  only  some  of  the 
boundary  conditions.  In  particular  it  does  not  satisfy  the  boundary  condition  at  t\  =  0. 


8.4  Calculation  of  the  T-limit  of  the  energy 

8.4.1  Hypotheses  and  summary  of  results  used  in  the  argument 

In  this  section  we  collect  the  hypotheses  used  for  the  derivation  of  the  T-limit  and,  for  the  con¬ 
venience  of  the  reader,  we  collect  the  various  results  and  definitions  used  in  the  argument.  The 
main  hypothesis  is  a  bound  on  the  slopes  of  the  functions  f±  that  has  the  effect  of  preventing  very 
thin  fingers  of  the  A-layer  from  penetrating  into  the  B-layer.  This  assumption  could  be  related  to 
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the  possibility  of  formation  of  nonclassical  austenite/martensite  interfaces  [1],  a  possibility  that  is 
therefore  being  ruled  out.  A  second  main  hypothesis  says  that  <p  grows  at  least  quadratically  as  we 
move  away  from  its  energy  well,  SO (3). 

Hypotheses  1)  We  assume  f  +  <  f~  +  (  and  f±  are  Lipschitz  functions  with  Lipschitz  constant 
c  >  0.  The  latter  means  that  |/±(s2)  —  /±(si)|  <  c|s2  —  Si|  for  all  —  £  <  si  <  s2  <  0.  It  also  implies 
that  the  region  Qg  =  {{xi,x2)  :  f+(x i)  <  x2  <  f~(x i)  +  (,  — £  <  x\  <  0}  (called  here  the  A-region) 
is  a  Lipschitz  domain,  and  therefore  the  rigidity  lemma  (see  below)  can  be  applied  on  VLg.  2)  The 
energy  density  <p  is  assumed  to  be  twice  differentiable  and  to  satisfy  <p(F)  >  C  dist2(F,  SO(3)).  3)  We 
assume  a  solution  A  =  A\  and  B  =  BA  and  b  =  bA,  m  =  m^,  •  ir1  =  1,  of  the  crystallographic 

theory  (4)  with  Aa  — >  A0  G  SO(3)Ui|a2=i  ^  I  as  A  — ►  0,  and  A\  —  BA  =  aA  0  n,  a0  ^  0. 

Summary  of  definitions  and  results  used  1)  The  notation  dist(F,  <S)  denotes  the  shortest 

Euclidean  distance  from  the  3x3  matrix  F  to  the  (closed,  bounded)  set  S.  In  the  case  rotation 

matrices,  S  =  SO(3)  =  {R  :  RTR  =  I,  detR  =  1},  and  assuming  detF  >  0,  an  alternative 

expression  is  dist(F,  SO(3))  =  |U  —  I|  where  U  is  the  right  stretch  tensor  of  elasticity  theory,  i.e., 

F  =  RU  is  the  polar  decomposition  of  F.  The  notations  L00,!/2,#1  denote  spaces  of  functions 

defined,  respectively,  as  the  set  of  functions  for  which  the  the  following  norms  are  finite:  for  L°°(d), 

ll/IU°°(n)  =  ess  supnl/l;  for  L2(0),  ||/|||2(f2)  =  /J/|2dx;  for  H\Q),  ||/||^i(n)  =  /J/|2dx  + 

fn  |V/|2dx.  Something  about  weak  convergence  here,  used  in  L2  and  H 1  so  far.  The 

interpolation  inequality  used  below  states  that  for  any  Lipschitz  function  /,  we  have  ||/||loo(r)  < 
2  1 

c||/II2=(r|  II/,|Iloo(r)-  Tlic  following  Rigidity  Lemma  [12]  will  also  be  used  in  the  argument:  for  each 
given  function  y  G  fWffi)  there  is  a  corresponding  rotation  matrix  R  G  SO  (3)  such  that 

f  |Vy(x)  —  R|2dx  <  C  f  dist2(Vy(x),  SO(3))  dx.  (66) 

Here  C  depends  only  on  0  G  M3,  which  is  assumed  to  be  a  Lipschitz  domain.  This  is  a  quantitative 
statement  of  the  fact  that,  if  a  deformation  gradient  is  at  each  x  G  0  near  some  rotation  matrix, 
which  a  priori  could  vary  from  point  to  point  in  0,  it  is  actually  near  a  single  rotation  matrix  on 
most  of  Q. 

Below,  the  “original  variables”  refers  to  deformations  expressed  as  a  function  of  X\,X2,X3  and 
satisfying  boundary  conditions  (54)  while  “new  variables”  refers  to  functions  u  (see  (57))  expressed 
as  a  function  of  ti,t2,t3  and  satisfying  the  equivalent  boundary  conditions  (61). 


8.4.2  Use  of  the  rigidity  lemma  to  restrict  the  form  of  a  minimizer 

Suppose  y\,  have  less  energy  than  the  naive  test  function  (63)  and  satisfy  the  boundary  conditions 
(54).  Then  yA,/f  satisfy 


dist2(VyA,  SO(3)Aa)  dx2dx\ 


^(VyA,  0)  dx2dx\  <  (7A2, 


(67) 


the  lower  bound  in  parentheses  being  part  of  our  hypotheses  above.  We  first  use  of  the  rigidity 
lemma  in  the  original  variables  to  restrict  the  rotation.  We  note  that  the  rigidity  lemma  (66) 
remains  true  also  with  R  replaced  by  RA\  and  SO(3)  concurrently  replaced  by  SO(3)Aa,  by  a 
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linear  change  of  variables.  Applying  this  lemma  we  therefore  have  the  existence  of  Ra  e  SO (3)  such 


that 


|VyA  —  RaAa|2  dx2dxi  <  C A2. 


(68) 


By  passing  to  a  subsequence  we  can  assume  Ra  — > ►  Ro-  In  view  of  the  uniform  Lipschitz  bound, 
—  f±  uniformly.  Thus,  we  can  assert  (68)  on  a  fixed  domain,  i.e., 


o  K+f-S 


IVja-RaAaI2  dx2dx i  <  C A2. 


-I  Jf++6 


(69) 


for  5  >  0.  Then  we  use  the  trace  theorem  applied  to  the  sub-boundary  S  =  {(x1,x2)  :  Xi  = 
0,  /+(0)  +  6  <  x2  <  (,  +  f~( 0)  —  fi}  to  deduce  that  there  is  a  constant  ca  such  that 


|yA  -  RaAax  -  cA|2dx2  <  C A2 


(70) 


Let  Xq  •  mA  =  0,  Xq  •  n  =  (1/2 )(/+  +  /  +C)n-m-L  and  let  B(x$,  r )  be  a  ball  of  radius  r  centered  at 
Xq.  Written  without  coordinates,  with  the  boundary  condition  yA(x)  =  x  inserted,  and  restricted 
to  a  subdomain,  (70)  is 


/ 

J  {x-m^=0}nB(xJ,r) 


RaAa)x  —  c\\2dA  <  CA2. 


Alternatively,  putting  Fa  =  I  —  RaAa, 


(71) 


|Fa(x  -  xj)  -  cA  +  FAxJ|2dA  <  C\ 


(72) 


^{x-mA=0  }nfi(x0A,r) 

Expanding  the  integrand  and  using  that  the  domain  of  integration  is  symmetric  about  Xq  we  get 


(I  -  RaAa)(x  -  Xq)  -  Ca  +  (I  -  RAAA)x^|2(iA 


'  (x-mA=0}nJB(x^,r) 


=  (F^Fa)  ■  [  (X  -  x0A)  ®  (x  -  Xq)  dx  +  7rr2|cA  -  FAxA|2  <  CA2.  (73) 

J  {x-mx=0}nB(xQ,r) 


Hence,  the  two  positive  semi-definite  terms  must  satisfy, 

|cA  -  FAxj)|2  <  C\2  and  (FAFA)  •  (I  -  mA  ®  mA)  <  CA2. 


(74) 


We  focus  on  the  second  of  these  and  recall  from  the  crystallographic  theory  that  Aa  =  I  —  AaA 
n  +  bA  ®  mA-  Using  that  (I  —  mA  ®  m^)m^  =  0  we  have  that 


(FaFa)  •  (I —  mA ® mA)  =  (I-RA-ARAaA®n)T(I-RA-ARAaA®n)-(I-mA®mA)  <  CA2.  (75) 


This  yields 

(I  -  Ra)T(I  -  Ra)  •  (I  -  mA  ®  mA)  <  CA2  +  2A  [(RA  -  I)aA  •  n  -  (mA  •  (RA  -  I)aA)(n  •  mA)] .  (76) 
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Noting  that  max|s|=1  |(Ra  —  I)s|2  =  2(1  —  cos6*a)  =  (1/2)  |Ra  —  I|2,  we  have  first  from  (76)  that 
|Ra  —  I|2  <  C A  and  then,  again  from  (76),  that 


|Ra  -  I|  <  C A. 

Using  again  the  form  of  A\  and  the  triangle  inequality, 

||VyA  —  A0||l2  <  ||VyA  —  RaAa||l2  +  ||(Ra  —  I)Aa||l2  +  ||Aa  —  A0||l2, 


(77) 

(78) 


we  have  that 


|VyA  -  A0 


dx2<lxi  <  C A2, 


(79) 


(or,  the  same  bound  with  A0  replaced  by  Aa).  In  summary,  the  rigidity  lemma  implies  that  a 
deformation  with  energy  less  than  ~  A2  is  actually  be  close  to  the  matrix  A0  in  L2,  the  quantitative 
bound  being  given  by  (79). 


8.4.3 


Bounds  on  the  positions  f*1  of  the  twin  boundaries  arising  from  energy  mini¬ 
mization 


We  have  assumed  that  0  <  f+  —  f  <  (.  Staying  for  now  with  the  original  variables  we  have 


o  rC+f'x 


|(VyA  -  AA)m-L|2  dx2d,x1  < 


o  r(+fx 


|VyA  —  Aa|2  dx2dx\  <  C A2. 


(80) 


(Recall  that  Vy  =  (l/m1  •  n)(y,i  Cg>m  +  y,2  ®n)  +  y,3  ®e.  An  application  of  Jensen’s  inequality 
(/a  Pdx  —  (l/(^  —  a))(/a  fdx)2)  t°  the  inner  integral  gives 


(VyA 


A\)m±  dx2\2dxi  <  C A2, 


(81) 


that  is, 


fC+fx 

/  (yA,2  -Aahi1)  dx2\2dx1  <  C A2. 

Jfi 


(82) 


Integrating  and  using  the  boundary  conditions  (in  the  original  variables,  i.e.,  using  the  boundary 
conditions  (54)),  we  have, 


(Ya>2  — A,mx)  dx2\2dxj 


fx  )( Ba  -  I)m|  +  (C  -  (A+ 


f\  ))(aa  “  I)mA  l2^xi  <  CA2. 


(83) 


Using  once  again  the  equations  of  the  crystallographic  theory,  we  have  (Aa  —  I)m^ 
and  (Ba  —  I)  nr1  =  (1  —  A)(n  •  mf  )aA-  Thus,  from  (83)  we  get 


f\  )  -  AC \2dxi  <  C A2. 


-A(n-mf)aA 


(84) 
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By  the  triangle  inequality,  ||/J  -  fx  ||1S  <  ||/jf  -  fx  -  A<||i»  +  || AC||i=  <  C A,  we  get 


|/+  -  ftfdXi  <  CX‘. 


'-i 


(85) 


This  can  be  combined  with  the  L°°  bound  of  the  derivative  by  using  the  interpolation  inequality 

(/a  =  ft  ~  f\  )> 


II/; 


a iil°°(r)  <  C'II/a|I12(r)||/aII|oo( 


<  C cs As, 


yielding  the  L°°  bound, 


\Jx  -  fX\  <  C\2/3. 


(86) 

(87) 


8.4.4  Lower  bound  of  the  energy 

We  continue  to  assume  that  y\,fx  is  a  low  energy  deformation,  i.e. ,  it  has  less  energy  than  the 
naive  test  function  (63),  and  we  seek  a  lower  bound  for  its  energy.  We  change  to  the  new  variables 
Ua(^i ,  tft)  and  define  =  { (ti ,  ^2)  :  —  t  <  t\  <  0,  0  <  t?  <  1}.  Let  va  =  yUA-  The  scaled  energy  of 
Ya,  fx  in  the  new  variables  is 

T of  ft  (X  +  A  (VA,1  -g\V\,2  )  <S>  A^TmA 
A  dne 

+  A  (aA  +  h\V\,2 )  <g>  A^Tn)  (C  -  (fx  ~  fft ))  dt idt2  <  C,  (88) 

and  we  seek  a  good  lower  bound  for  the  left  hand  side.  First  we  note  by  the  assumed  lower  bound 
</3(F)  >  C  dist2(F,  SO(3)),  the  L°°  bound  on  g\  (cf.,  (58),  and  the  fact  that  the  angle  between  the 
two  vectors  A^rnij  and  A^rn  remains  bounded  away  from  zero  in  the  limit  A  — >  0  (as  we  have 
assumed),  we  have  )| Va ||i^r  <  C  and  therefore,  for  a  subsequence  (not  relabeled),  we  have  that 
vA  v  in  Hl(Vtft). 

We  denote 

Gj  =  (vj,i  -gwx, 2 )  0  A((Tmi  +  (aA  +  hxv\, 2 )  0  A((Tn, 

Jx  =  (C-(/> T-AU).  (89) 

and  by  (87)  and  the  remarks  just  above,  we  have  Ja  — » ►  £  uniformly  and  Ga  — 1  G  in  L2((—£,  0)  x 
(0, 1)).  The  bound  (88)  becomes 


—  /  <p  (I  +  AGa)  J\  dt\dt2  <  C,  (90) 

A  Jne 

We  now  do  a  careful  Taylor  expansion  of  the  integrand  in  the  spirit  of  [12].  We  introduce  the  “rare” 
set  where  Ga  is  large,  i.e., 

Ex  =  {x  G  :  |Ga(x)|  >  A-^}  (91) 

and  we  note  that  on  Cl(\Ex,  we  have  A | Ga |  <  A1/2  —>  0.  Hence,  using  that  <p  G  C2  and  is  minimized 
at  I,  we  have  on  Clg\  Ex, 

(p(l  +  AGa)  =  2  ^^a)  +  <^(|AGa|)|AGa|2,  (92) 
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where  u>(s)  — >  0  as  s  — >  0.  Since  we  are  only  seeking  a  lower  bound  and  <p  >  0  we  simply  throw 
away  the  set  E\.  and  write, 


1 

A2 


Cp  (I  +  AGa)  J\  dt\dt2 


> 


A2 


i  av(i) 


2  OF2 


(AGa,  AGa)  +  w(|AGa|)|AG> 


J\dt\dt2 


ine\Ex 


1  d2(p(  I) 

2  3F2 


(Ga,  Ga)  +  cj(|AGa|)|G> 


J\dtidt2 


(93) 


Since  Ga  is  bounded  uniformly  in  L2(Q/)  and  cu(|AGa|)  — >■  0  uniformly,  then  the  error  term  can  be 
neglected: 


liminf-^-  [  <p  (I  4-  AGa)  J\  dtidt2  >  liminf  [ 

^  A2  Jch  A^O  Jat\Ex  L 


=  lim  inf 

A^O 


1  d2(p(  I) 

2  dF2 

1  d20(I) 

2  W2 


(Ga,  Ga) 


J\dt\dt2 


( Xne\Ex  Ga  ,  Xne\Ex  ga) 


J\dt\dt2. 


The  quantity  Xnt\Ex  converges  boundedly  almost  everywhere  to  xne  and  therefore  XnAEx  Ga  — *■  G 
(i.e.,  the  same  limit  as  previously)  in  L2(Q.f).  By  the  lower  bound  assumed  on  <p,  we  have  <92<p(I)/<9F2 
is  a  positive-definite  quadratic  form  on  symmetric  matrices,  that  is,  it  is  a  positive-definite  elasticity 
tensor.  Thus,  using  its  weak  lower  semicontinuity,  we  have  the  desired  lower  bound, 


lim  inf  — 

A^o  A2 


I T  AGa)  J\  dt^dt2 


'ld2<p{I) 

2  W2 


(G.G) 


Ctdt\dt2- 


(94) 


It  remains  to  identify  the  specific  form  of  the  limit  G.  By  the  definition  (89)  of  Ga  we  have 


(vA,i -3AVa,2)  ®  AArmA  +  (aA  +  hAVA,2)®  AATn  ^  G  in  L2(Qe).  (95) 


Since  h\  —  1/C,  uniformly,  the  only  term  whose  limit  is  not  completely  clear  is  g\V\,2.  This  term 
has  a  uniform  bound  in  L2(f^),  so  we  can  find  its  limit  by  testing  against  a  function  xp  G  Co°(f^) 
and  integrating  by  parts: 


gx^x^'f  dtxdt2 


f  t2fx  '  +  (1  ~  h  )ff 
I (it  C-(fx-fx) 

f  fx  ~  ft 
Jne  C  ~  (ft  ~  /a") 


■v\,2ip  dtidt2 


— vxipdt^-  gx^x^,2  dtidt2. 
J 


(96) 


We  claim  that  both  of  the  latter  integrals  tend  to  zero.  This  is  clearly  true  of  the  first  one,  as 
fx  ~  fx  is  bounded  in  L°°(f If)  and  therefore  has  a  weak*  limit;  this  limit  must  be  zero  since 
I/a  -  fx  \  This  combined  with  the  strong  convergence  va  — > ►  v  in  L2(Qe)  establishes  the 

assertion.  For  the  second  integral,  we  look  at  the  boundary  conditions  (61), 


AvA(ii,0)  =  fx(ti)(l  -  A)aA,  AvA(ti,  1)  = /A  (ti)(l  -  A)aA,  -i  <  h  <  0.  (97) 
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Since  by  the  trace  theorem  vA  is  bounded  in  L2(dQ(),  then  the  uniform  limits  of  f±,  and  therefore 
weak*  limits  of  both  individually  must  be  zero.  This  shows  that  the  second  integral  tends  to 
zero.  Summarizing,  we  have  that 


G  =  V,!  0 A0  Tm0  +  (a0  +  -v,2  ’ 


An  Tn. 


Note  that  play  no  role  in  the  lower  bound,  which  later  turns  out  to  be  the  T-limit.  The 
boundary  values  of  v  determine  the  values  of  /A 

Also,  since  vA  is  bounded  in  L2(dtte),  it  follows  from  the  boundary  condition  vA(— £,  t2)  = 
C(1  —  t2  +  rj\/X))(l  —  A)aA  that  (passing,  if  necessary,  to  a  subsequence), 


f  (99) 

We  conclude  that  there  are  several  vestiges  of  the  boundary  conditions  on  v  and  the  presence  of 
/±.  These  are:  (i)  v(ti,0)  ||  a0,  v(ti,  1)  ||  a0,  (ii)  (v(ti,0)  -  v(H,  1))  •  a0  >  0,  (iii)  v(0,t2)  =  0,  and 
(iv)  v(-f,  t2)  =  C(!  -  t2  +  fj) a0. 


8.4.5  Upper  bound 

For  the  upper  bound  in  the  T-convergence  argument,  we  have  to  show  that,  given  v  G  Hl  (O/) 
satisfying  the  boundary  conditions  v(ti,  0)  ||  a0,  v(£i,  1)  ||  a0,  (v(H,  0)  —  v(ti,  1))  •  a0  >  0,  v(0 ,t2)  = 
0,  and  v(— £,  t2)  =  C(1  —  h  +  fj)  a0  there  are  vA  G  Hl{£ l#),  f±  G  kF1,00(— £,  0)  (Lipschitz  constant  c), 
f+  >  f~ ,  and  Tjx  G  M  satisfying  the  boundary  conditions 

AvA(ti,0)  =  (ti)(l  —  A)aA,  AvA(ti,  1)  =  fx(h)(l  -  A)aA,  -l  <  tx  <  0, 
vA(0,  t2)  =  0,  vA(— £,  t2)  =  C  (1  +  ??a/A  —  t2)  (1  —  A)aA,  0  <  t2  <  1,  (100) 

such  that 

—  f  (p  (I  +  AGa)  J\  dtidt2  — » ►  f  —  ^  (G,  G)  (dtidi 

A2  Jck  Jne  2  OF 2 

where  GA,  G  are  calculated  from  vA,  v  in  the  usual  ways: 

Ga  =  (vA,i  -gAvA,2)  0  A“TmA  +  (a  +  hAvA,2)  0  A“Tn 
G  =  v,i  0Aq  Tm0  +  (a  +  ^v,2  )  0  Aq  Tn. 

If  v  G  C°°(Cle)  and  satisfies  the  boundary  conditions  v(fi,0)  ||  a0,  v(fi,l)  ||  a0,  and  (v(fi,0)  — 
v(ti,  1))  •  a0  >  0,  v(0,f2)  =  0,  and  v(— £,  t2)  =  £(1  —  t2  +  fj) a0  then  we  simply  take  vA  =  LAv, 
where  LA  G  M3x3  is  a  linear  transformation  satisfying  LAao  =  aA  and  LA  — >  I  as  A  — >  0,  and  we 
take  r]\  =  fj/X.  Due  to  the  differentiability  of  v  on  chb  we  have  for  A  sufficiently  small  that 
defined  by  (100)  is  Lipschitz  with  Lipschitz  constant  c.  Then  the  conclusion  (101)  follows  by  direct 
calculation  using  strong  convergence  and  classical  Taylor  expansion. 

If,  however,  we  only  know  that  v  G  and  satisfies  the  boundary  conditions,  then  there  is 

a  slight  problem  with  satisfying  the  Lipschitz  conditions  on  f±  obtained  from  the  (100).  However, 
in  this  case  we  first  approximate  v  by  G  C'°°(^),  — >  v  G  Hy{hl()  and  v^-1  satisfying  the 


(101) 


(102) 
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same  boundary  conditions  as  v.  Using  the  quadratic  growth  of  the  integrand  on  the  right  and  side 
of  (101),  we  have  that 


'ne  L 


1  d2<p(  I) 

2  <9F2 


(G(<5),G(<5)) 


C,dt\dt2 


L 


1  d2<p(I) 

2  <of2 


(G,G) 


(dt\dt2, 


(103) 


where  G^  =  v^,i  0AO  Tm0  +  (ao  +  ^v^,2  )  0  A0  Tn.  For  the  left  hand  side  of  (101)  we  now  choose 
va  =  Lav^,  A  <  A with  A5  >  0  for  S  >  0  chosen  small  enough  to  respect  the  Lipschitz  conditions. 
In  particular,  there  are  positive  sequences  A*,  =  A gk  — ^  0,  <5^  — ^  0  such  that  v\k  =  A^La^v^  satisfies 
the  Lipschitz  conditions.  By  Taylor  expansion,  strong  convergence  and  (103)  we  have  that  (101)  is 
satisfied  for  the  sequence  v\k . 


8.5  Summary  of  form  of  the  limiting  energy  of  the  transition  layer 


The  lower  bound  found  above  in  Subsection  8.4.4  shows  that  the  limiting  energy  of  a  low  energy 
sequence  parameterized  by  A  is  greater  than  or  equal  to 


1  d2(p{l) 

2  W2 


(G,G) 


C,dt\dt2, 


(104) 


with  G  replaced  by  v,i  0AoTmo  +  (ao  +  ^v,2 )  0  A0Tn.  We  show  in  Subsection  8.4.5  that  given 
a  function  v  G  H QfiQ  satisfying  the  limiting  boundary  conditions,  its  energy  (104)  can  be  ap¬ 
proximated  arbitrarily  closely  by  the  original  expression  for  the  energy  evaluated  on  an  appropriate 
sequence  va  satisfying  the  original  boundary  conditions.  These  two  statements  imply  that  the 
energy  (104)  is  the  T-limit  of  the  original  (A  >  0)  energy  under  its  boundary  conditions. 

In  particular,  it  follows  that  the  limit  of  the  family  of  minimizers  of  the  original  energy,  param¬ 
eterized  by  A,  is  a  minimizer  of  (104).  Also,  the  energy  of  this  family,  rescaled  by  dividing  by  A2, 
converges  to  the  energy  of  the  T-limit. 

The  full  statement  of  the  T-limiting  problem  is: 


where 


mm 

veA 


Me 


ld2£(I),  A_r  ,  1  .  A_r 

2  dF2  (v’1<g>A°  mo  +  (ao  +  -v,2)0Ao  n, 


1 


v,!  0AO  Tm0  +  (ao  +  -v,2  )  0  A0  Tn) 


Qdtidt2i 


(105) 


A=  {(v,ij)  €  ff'ff!,)  x  R  : 

Observe  that  f/,  which  describes 
the  minimization. 


v(U,  0)  ||  ao,  v(U,  1)  ||  ao,  (v(U,  0)  -  v(U,  1))  •  a0  >  0, 
v(0,  t2)  =  0,  v(  f .  t2)  -  C ( 1  -  t2  +  7))a„}.  (106) 

the  asymptotic  lowering  of  the  thin  twin  band,  takes  part  in 
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